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EDITORIAL 


THESIS ABSTRACTS 


Since 1938, this Journal has regulaily published abstracts of appropriate 
theses which have been accepted for higher qualifications in psychology and 
education. In recent years, however, in spite of the increased output of theses, 
these have come mainly from a very small number of universities. This note is 
included to remind Heads of Departments of Psychology and of Institutes and 
Departments of Education of the service which is available. 


Thesis abstracts for consideration should be forwarded to the Editor by 
the Head of the Department concerned and may be intended either as the sole 
published account of the thesis or as a preliminary report of a research which is 
intended to be published at length at some future date. It is essential that they 
should be abstracts, not short articles, and Heads of Departments are specially 
asked to sub-edit them to make them as clear as possible consistent with 
brevity. Normally, abstracts of more than 1,500 words will not be considered 
and only exceptionally will tables be allowed. 


Proofs will normally be returned to the Department of origin for correction. 


CORRECTION 


The heading of the three papers on researches using the Maudsley Person- 
ality Inventories in the last issue of the Journal (p. 241) should have been 
“ Three Experiments using Maudsley Personality Inventories ” not “ Children's 
Personality Inventories.” Savage used the M.P.I. in his research. 


Dr. Warburton has also pointed out that the statement by Callard and 
Goodfellow that the J.M.P.I. is the only test of this kind available for English 
children, is not strictly true. Cattell’s High School Personality Questionnaire 
(H.S.P.Q.) for ages 12-16 and his Children’s Personality Questionnaire (C.P.Q.) 
for ages 8-12, are available in this country. Although they were standardised in 
the United States they have been used in Manchester and have been found 
suitable for use with English children. i 
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A FURTHER STUDY OF THE EDUCATIONAL PROGRESS 
OF CHILDREN WHO HAD RECEIVED REMEDIAL 
EDUCATION 


By K. LOVELL, C. BYRNE anp B. RICHARDSON 
(University of Leeds) 


Summary. Children who had received full-time teaching ın Remedial Centres 
of an English education authority were traced, and their reading ages assessed 
along with their understanding of certain arithmetic concepts. A Summary 
of the data obtained is given together with the inter-relationships found between 
a number of the relevant variables 


I.— INTRODUCTION. 


Iw an earlier paper (Lovell, ef al. 1962) some details were given relating to the 
reading ages of children who had received remedial teaching and were subse- 
quently followed-up. The pupils concerned in that study had received one 
period of remedial teaching per week, each period lasting 45 minutes. In the 
present study junior school children ia a different education authority, who had 
been sent to full-time remedial centres for a period, were followed-up and their 
reading ages assessed. Furthermore, these children also undertook a test of 
arithmetic concepts at follow-up. 


It must be clearly stated at the outset that this investigation throws no 
light on whether the children who attended these centres became better adjusted 
to school work and life, or whether their general motivation and morale 
improved under full-time remedial teaching. But the study does throw light 
on the likely educational progress and eventual performance, during school 
years, of children considered to be in need of remedial teaching. Furthermore, 
1t js now possible to compare the progress of these children with those in tbe 
earlier study who had remedial teaching for only one period a week, and it has 
been possible to throw some further light on the question as to whether, in the 
long run, children who receive remedial teaching in special centres make more 
progress than those who stay in their own schools. 


IL-—SAMPLE STUDIED AND PROCEDURE. 


The local education authority had five full-time remedial centres and the 
teachers-in-charge visited the local junior schools and picked out children, 
using intelligence and reading tests, who were considered to be in need of 
remedial teaching. Not all children so picked out attended the centres. Some- 
times, the head of a junior school did not regard a particular child as suitable, 
and some parents refused permission for their children to attend the centres in 
lieu of school. In general, those who attended the centres were deemed to be 
the most likely to make most improvement under special teaching. 


` An examination of the records at the centres yielded complete data on 261 
children who had attended full-time, and who had returned to ordinary schools 
not later than July, 1960. The follow-up study was done during the period 
November, 1961—March, 1962, so that no child was included in the survey 
unless he had been back at ordinary school for at least sixteen months. 
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The subjects were then divided up into those who were backward and those 
who were not, the former group being defined as those pupils whose reading ages 
(on whatever test was used at the time of referral) were less than 85 per cent. of 
their chronological ages. There was thus found to be 184 children classified as 
backward, and 77 as not backward. It will be quite obvious at this point that 
some children had been selected for attendance at these centres on the grounds 
of a discrepancy between reading age and mental age. Of the backward children, 
nine had moved to other parts of the country, one was in hospital, four were 
absent from school during the follow-up period, and six could not be traced by 
the authority, giving 164 backward children available for study. Of the non- 
backward group only one child was not found (she had moved to a private 
school outside the authority’s area) giving 76 pupils in this category available 
for investigation. 


The teachers-in-charge of the centres were free to use intelligence or reading 
tests (often the N.F.E.R. Sentence Reading Test) of thei: own choice in selecting 
children for the centres. Of the backward children fifty-nine had taken the 
Moray House Picture Test 1, fifty-four the Intelligence Tests for young children 
(Valentine), twenty-five the Sleight Non-Verbal Intelligence Test, fourteen the 
N.F.E.R. Non-Verbal Test 5, and twelve the Terman-Merrill .evision of the 
Binet test. Again, either just before or immediately after admission, 142 subjects 
had taken the Schonell Graded Vocabulary Test R1 and twenty-two the Holborn 
Reading Scale for Infants and Juniors; while at the time of return to school, 
102 had been given the Schonell Graded Vocabulary Test R1, and sixty-two the 
Holborn Reading Scale. 

In the non-backward group forty-eight had taken the Moray House Picture 
Test 1, eleven the Intelligence Tests for Young Children (Valentine), three the 
Sleight Non-Verbal test, thirteen the N.F.E.R. Non-Verbal Test 5, and one the 
Terman-Merrill revision of the Binet test. just before or immediately after 
admission to the centres, sixty-two children had their reading ages assessed 
using the Schonell Graded Vocabulary Test RI and fourteen the Holborn 
Reading Scale, while on return to school the numbers for these two tests were 
respectively fifty-seven and nineteen. 


On follow-up, the children were found in over fifty schools. Both the 
Schonell Graded Vocabulary Test R1, and the Vernon Graded Word Reading 
Test were given to assess the reading ages. Moreover, each pupil undertook, 
individually, a test which involved the understanding of certain arithmetic 
concepts devised by one of the authors (K.L.). The questions, which were 
duplicated were presented to the subjects. The tester read aloud a question, 
and the child followed on the paper. The latter could either answer orally or in 
writing. No time limit was imposed. When the tester was satisfied that the 
child had answered the question to the best of his ability the experimenter 
passed on to the next question. The percentage numbers of children respond- 
ing correctly to some of the least well answered questions are shown in Table 4. 


Y 
In order to have a further check on educational progress, figures will be 
presented showing the numbers of children in the bottom stream, next from 
bottom stream, etc., of multi-stream schools, on follow-up (Table 5). 


At one centre initial data were obtained on fifty children who, on the basis 
of a discrepancy between R.A. and M.A., had been regarded as being suitable for 
attendance at the centre, but who, for the kind of reasons already explained, did 
not, in fact, attend. Such children were left in ordinary schools and will be 
referred to as the control group. All children who were picked as likely to be 


K. LovziL, C. BYRNE and B. RICHARDSON 5 


suitable for this particular centre had taken the N.F.E.R. Sentence Reading 
Test ; in addition, other reading tests were given to children when they actually 
joined the centre. The control group was matched with pupils who went to the 
centre for chronological age, reading quotient (N.F.E.R. Sentence Reading 
Test), sex, and, as far as possible, intelligence test score and school; and this 
matched group which attended the centre will be referred to as the experimental 
group. While the matching procedure was not perfect, it can be said with 
certainty that the teachers of the control group had no idea whatever, that 
some years later these children would be followed up and retested. Moreover, 
these fifty children in the experimental group were drawn from fifteen schools 
so that the classes out of which they were taken were but little reduced in 
size. At follow-up, then, the control group was also traced and the two reading 
tests given. See Table 6 


ITI.—RESULTS. 
The data obtained have been given in tables for the sake of clarity. 


TABLE 1 


Data RELATING TO THE PERIOD OF REMEDIAL TEACHING. 


Backward Group 
N=164 


Non-Backward Group 
N=76 





C.A. (in years) at time of referral. . 


Mean Reading Quotient (&x) 


R.A. (in years) at end of remedial 
teaching .... ... 209 

C.A. (in years) at end of remedial 
teachin: i ijs. 


eaching ... . Maus 
Mean Reading Quotient 


Mean length of period of remedial 
teaching (years) 


Extremes (in years) in period of 


remedial teaching ... 


Mean improvement in R A. during 


remedial teaching (in years).... 


Extremes (in years) of improv 
ment during remedial teaching 


M—-—98.8, c 12-8 
Me64 
M=92,¢=:9 


70 


-M= 84,¢=14 


M=10-4, 9 =1-0 
81 





M=111-3, g =13-3 
Me 82,g- 1:0 
M= 87,0- 7 


94 


M=10-0, 9 —1:3 
M= 96,9 = -8 
104 
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TABLE 2 


DATA RELATING TO THE PERIOD SINCE REMEDIAL TEACHING CEASED. 


Variable Backward Non-Backward 


N=164 N=76 

R.A. (in years) at follow-up 

(Schonell RI) .............. M—10-0, e =1 -8 11:4, o —1-2 
RA (in years) at follow-up 

(Vernon Graded Word) ...... M= 9:8, 9 =2:0 11:4, o —1:5 
C.A. (in years) at follow-up ....| M=13-1, œ =1-6 12-1, 0 =1-3 
Reading Quotient (R.A. from 

Schonell RI)  .............. 76 94 


Mean progress (in years) since 

remedial teaching ceased — .... 1-6 1+4 
Mean length of time in years 

between end of remedial teach- 

ingand follow-up .......... ; 2-7 2:5 
Extremes of progress made (in 

years) since remedial teaching 


ended : 
12-23 months previously .... —0-8 to 4-4 —0-5 to 2:4 
M=1-2 M=0-7 
N=64 N=28 
24-35 months previously .... —0-4 to 4:5 —0-6 to 4:1 
ES y M=1-7 M-1:8 
N42 N=30 
36-47 months previously .... 0-5 to 3-2 1-0 to 3:8 
M=2-0 M=2-2 
N=31 N=12 
48-+ months previously .... —1-8 to 5.8 0-4 to 2-3 
M=1-9 M=1-1 
N=27 N=x6 
TABLE 3 


DISTRIBUTION or READING AGEs (ScHONELL R1) AT FOLLOW-UP. 


Reading Agein Years 
5-5 65 7.5 8-5 9-6 10-5 11:5 12:5 13.5 


-5-4 6-4 7.4 8-4 9-4 10-4 11-4 12-4 13-4 14-4 





10 24 28 26 35 24 ii 2 164 
0 1 4 6 28 24 8 5 78 
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TABLE 4 


PERCENTAGE NUMBERS OF CHILDREN GIVING CORRECT REPLIES TO SELECTED QUESTIONS 
IN THE TEST OF ARITHMETIC CONCEPTS. 


1.—In the number 26 what does the figure 6 stand 
for (worth) ? Or, What does it tell us ? 

2.—1In the number 26 what does the figure 2 stand 
for? Or, What doesit tell us ? 

3.—In the number 404, what does the middle figure 


T 
4.—In the number 404 how many times biggeris the 
4 withastar than the 4 with a cross ? 
$.—What number is half way between 26 and 32?.. 
6.— Write down the answer to this exercise without 
actually multiplying the numbers 
9x 87 divided by 9 
6a. —Explain how you obtained the answer you have 
just given (in 6) Or, How did you get that? .. 





TABLE 5 


PERCENTAGE NUMBERS OF CHILDREN IN GIVEN STREAMS OF MULTI-STREAM PRIMARY, 
SECONDARY MODERN OR ALL AGE SCHOOL AT FOLLOW-UP. 


Multi-Stream School 


Next to 
Bottom Bottom Top 
Stream Stream in Stream 
School 3 or more 
Stream 
School 


"ms 16 18 
Non-Backward 20 j 13 





TABLE 6 
DATA RELATING TO THE EXPERIMENTAL AND CONTROL GROUPS—50 IN EACH GROUP. 


Experimental Group 
M=116, c —10-0 M—112, c —10-6 


group first joined the Centre....| M=7-9 M=7°9 
Reading Quotient (N.F.E.R. ` 

Sentence Reading) M=85 M=85 
Length of time (in years) at Centre| M=0-9 


R.A. (in years) at follow-up 
(Schonell R1) M—10-7, c —1:2 M=10°8, e =1-3 


. M=10-5, g —1:5 =| M=10-7, g =1-5 
C.A. (in years) at ICD. M=11-3 M=11:3 
Reading Quotient (Schonell R1).. 94 - 96 
Mean length of period (in years) 
since the experimental group 
first joined the Centre M=3-4 -| M==3-4 
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TABLE 7 


INTER-RELATIONSHIPS BETWEEN RELEVANT VARIABLES Usinc KENDALL's Tau Co- 
EFFICIENT, - 


Relationship Backward  |Non-Backward 
Group Group 





.L.Q. and progress made during remedial teaching 22** 5* 
I.Q. and R.A. (Schonell R.1) at follow-up 37** :10 
Length of remedial teaching and progress made 

during remedial period :3** :35** 


Progress made since remedial teaching ceased and 

time elapsed since :27** :30** 
Progress made during remedial teaching and R.A. at 

follow-up : 37** 4** 
I.Q. and progress during remedial teaching holding 

constant the length of remedial teaching :20t “16f 





*—Indicates significance at 5 per cent. level. 
** Indicates significance at 1 per cent. level. 
t—It is not possible to test the significance of Txy.z 


IV.—DiscussioN. 


The results may be convemently discussed under a number of headings : 


M The data relating to the backward group (which has a mean I.Q. close 
to 100) during the period of remedial teaching shows much the same general 
trends as the corresponding data in the earlier study (Lovell e£ al. op ctt.). There 
is a rise in the reading quotients of the same order of magnitude in both studies, 
and overall, we may say that as far as reading ages are concerned, the children 
in the earlier study, who had had remedial teaching for only one period a week, 
made equally as much progress per unit time as those now under discussion. 
Furthermore, the progress made in reading per unit time since remedial teaching 
ceased is almost identical in both investigations. 

(2) There is tendency in the previous investigation for the reading quotients 
to fall between the end of treatment and follow-up. The same trend is clearly 
noticeable in both backward and non-backward groups in this study. ` 


(3) Table 3 shows that only some 40 per cent. of the backward group 
had an understanding of place value involving tens and units at follow-up, and 
far fewer could fully appreciate place value in relation to hundreds, tens and 
units. This is a serious matter since an understanding of place value is funda- 
mental in our number system. 


(4) Over three-quarters of the children who were backward at time of 
joining the centres, were found at follow-up, in the bottom or next to bottom 
stream of a three or more stream school. À further 16 per cent. were in single 
stream schools, and only 5 per cent. in a top stream. This clearly confirms the 
view expressed in the earlier paper—that it is only very few children, backward 
at 81-9 years of age, who are later found among the academically abler pupils. 
Not a single child of secondary school age in this group was found in a grammar 
school at follow-up. . : : n 
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(5) The findings given in Table 6 came somewhat as a surprise. After a 
mean period of almost 3} years from the time of referral, no difference in the 
mean reading ages could be found between those who did, and those who did not 
attend the remedial centres. While the matcbing procedure could not be perfect, 
it was done with great care. Moreover, those who went to the centres were 
regarded as the most promising pupils. It is still possible, of course, that up to 
one year after referral, the mean reading age of those attending the centres 
might have been ahead of the mean reading age of those in school. 


(6) The non-backward group (mean I.Q. 111) made about the same 
progress per unit time in reading as the backward group during the period of 
remedial teaching. The reading quotient rose but fell markedly again in the 
period after remedial teaching ended so that at follow-up the quotient was 
roughly the same as at the time of referral. 


(7) In the arithmetic test some two-thirds of the non-backward grou 
understood place value in relation to tens and units, but the more difficult 
questions relating to hundreds, tens and units remain poorly answered. 


(8) In spite of the high mean I.Q. of the non-backward group, nearly half 
of them were found in the bottom stream, or next to bottom stream of a three or. 
more stream school at follow-up. One in five of the group was found in a single 
stream school, and roughly one in seven in the top stream of a multi-stream 
school But of those of secondary school age at follow-up, not a single child 
was found in a grammar school. 


(9) It must be stressed that although the progress made by the majority 
of the children involved in this study was limited, the progress made was of 
great importance for the pupils concerned. To attain a reading age of 9+ years 
is a very real progress for the individual. i 

(10) The correlation coefficient between I.Q. and progress made during 
treatment (Table 7) is small but significant in the backward and non-backward 
group. This clearly parallels the finding in the earlier study. 


(11) The previous study and this one, which together involved the follow-up 
of five hundred children, compared children taught singly, and in small groups, 
for one period per week, with children of somewhat higher I.O. taught full-time 
in remedial centres. Whatever arrangements we have, the long-term results 
seem to be much the same. This surely suggests that we know very little as 
yet about the aetiology of scholastic failure and indicates the great need for 
research. 


ACKNOWLEDGEMENTS.—Our thanks are due to the Chief Education Officer 
and the Senior Educational Psychologist of the Local Education Authority 
concerned for their generous help in conducting the enquiry and for permission 
to publish our findings. 
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DEVELOPMENTAL PATTERNS OF SEVERELY SUB- 
NORMAL CHILDREN 


By MARY WOODWARD* anp DIANA J. STERNt 
(Fountain Hospital, London) 


Summary. Eighty-three severely subnormal children aged under 9 years were 
observed with the aim of comparing their development in different areas and of 
examining the relation of various aspects of development to Piaget's sensori- 
motor stages. It was found that locomotor development was significantly in 
advance of speech development Children classified at various sensori-motor 
stages differed significantly 1n speech and in locomotor development. Certain 
developments in language, drawing, performance ability and social responses 
were associated with the attainment of the last sensori-motor stage, thus 
suggesting that the achievement of the end of the sensori-motor period is an 
important step in the development of severely subnormal children. 


I.—InTRODUCTION. 

A PREVIOUS study (Woodward, 1959) investigated a method of classifying the 
severely subnormal in terms of the six stages of sensori-motor development 
which have been distinguished by Piaget (1953 and 1955). The results indicated 
. that severely subnormal childien pass through these steps in the same sequence 
as Piaget suggests for normal children up to the age of two years. The aim of 
the present enquiry was (i) to determine the locomotor, language and social 
development of children classified at these sensori-motor stages, and (ii) to 
examine patterns of development in young severely subnormal children. 

The main theoretical interest attaches to the comparison of language 
development and the Piaget stages. At the last sensori-motor stage, children 
solve problems in such a way as to suggest that they are able to anticipate the 
future position of an object before they move it, and they persistently search 
for hidden objects in a manner which suggests that they are aware of the perm- 
anence of objects. According to Piaget (1950), this behaviour indicates that the 
child has some means of representing objects to himself in their absence, and 
this implies the development of ' a symbolic function’ which makes possible 
the systematic acquisition of language. This suggests the hypothesis that 
severely subnormal children will make a similar development in language at the 
end of the sensori-motor period. 

II.—Susjects. 

The subjects were resident at the Fountain Hospital, London. The total 
hospital population born during 1951 or later was selected in the first instance 
as the group to be investigated. Children were then excluded from this group 
if they had any condition, revealed in the medical examination, such as 
cerebral palsy, gross hydrocephalus, or a visual or auditory defect, that is likely 
to retard any aspect of development. In addition, two children with I.Os. of 
over 50 were omitted from the study. This left a group of fifty-four children. 
During the following eighteen months twenty-nine new admissions were added 
to the group ; these were consecutive cases without motor or sensory disabilities. 
This yielded a total group of eighty-three children (fifty-one boys and thirty- 
two girls), aged 11 months to 8 years 7 months (mean age—4-85 years), Three 
children had Terman-Merrill I.Qs. of 42, 41 and 31 ; the rest failed one or more 
of the six two-year items on this scale (forty-nine failed all six), thus falling 
below the lower limit of the scale. 


* Now at University College, Swansea. 
t Now at Child Guidance Clinic, Bexleyheath, Kent. 


IO 


M. WOODWARD and DIANA J. STERN II 


III.—MzTHOD. 


The main classification was made in terms of Piaget's six sensori-motor 
stages. Since these stages have been described in detail in the earlier study 
already referred to, only the main criteria are given below. 


Stage I (birth to 4/5 weeks). 
Use and extension of reflex activities. 


Stage I] (1 month to 4/5 months). 
Integration of reflex activities with higher activities, e.g., co-ordination 
of vision and hearing and vision and grasping. 


Stage III (4/5 months to 8/9 months). 

Repetition in the same form of chance-made movements that produced an 
effect on objects that the child saw or heard (e.g., shaking a rattle). Following 
the line of fall when an object is dropped and recovering half-hidden objects. 


Stage IV (8/9 months to 11/12 months). 

At the beginning of the stage, removing an obstacle in order to grasp a toy 
which is visible: at the end, removing an obstacle when the object is hidden 
but searching only in one place. Visual examination of objects by turning them 
round and round and holding them at various distances. 


Stage V (11/12 months to approximately 18 months). 

Obteining an object that is out of reach by pulling a string attached to it, 
or the support on which it stands ; searching briefly ir different places when an 
object is hidden. Exploratory manipulations which vary the effect on objects, 
e.g., banging a toy successively on different-sounding surfaces. 


Stage VI (approximately 18 months onwards). ' 
Inferring the result of an action before the object is moved, e.g., using a 
stick in order to obtain a toy that is out of reach ; searching for objects for a 
longer period (i.e., awareness of the permanence of objects is acheived). 
More advanced manipulation was investigated by the following performance 
tasks from the Revised Stanford-Binet (Terman and Merrill, 1937) and Merrill- 
Palmer (Stutsman, 1931) scales of intelligence : 
Four-brick tower. 
Three-figure form board. 
Three-figure form board rotated. 
Wall in peg boards A and B. 
Block bridge. 
Bead threading. 
Seguin form board. 
Little pink tower. 
Six-cube pyramid. 
_ fa child did not succeed with any of the first four tasks, the rest were not 

given. 

It was subsequently found necessary to derive a performance score for each 
child. One point was awarded for each all-or-none task that was passed, accord- 
ing to Merrill-Palmer and Stanford-Binet criteria of success ; timed tasks (e.g., 
peg boards and Seguin form board) were given weighted scores on the basis of 
Wed Refusals were allowed for as in E Merrill-Palmer method of scoring 
(Stutsman, 1931). 


I2 Developmental Patterns of Sub-normal Children 


The subjects were in the phase of development which, in normal children, 
would be assessed by means of developmental scales. These were used in the 
present enquiry as a basis for the assessment of speech and locomotor develop- 
ment, but experience of administering such scales to severely subnormal childien 
suggested certain modifications. 


The locomotor section of Gesell’s schecule (Gesell, 1940 and 1941) was 
used, with the exception of items for which special equipment, such as walking 
boards, was needed. Gesell’s instructions for administration were followed. 
In cases of doubt additional information was obtained from nursing staff. 
At the more advanced levels, 24 years onwards, some items (e.g., standing on 
one foot) require the child to imitate the examiner. The co-operation of with- 
drawn children could not be obtained for these items, and teaching and nursing 
staff could not supply information on them. Thus, these imitative items were 
omitted from the scoring for all children. 


The language sections of developmental scales include, in addition to 
vocalisation and comprehension items, others relating to response to sound, 
manipulation of a bell and social responses. "When emotionally disturbed 
children obtain low scores on language scales because they fail to respond to 
sounds, toys and people, the result does not accurately describe their language 
development. In this study only items of vocalisation and comprehension 
were included. Severely subnormal children frequently show scatter on a 
language scale, verbal comprehension beinz relatively more advanced than 
vocalisations. Thus, these two aspects of lenguage were treated separately. 


For speech development the vocalisation items were abstracted from Gesell's 
scale, together with the age norms assigned to them by him. Scores in verms 
of ' speech ages ' were based on these norms. Records were made of the type of 
vocalisations uttered by the child during the examination, and further informa- 
tion on habitual sounds was obtained from aursing and teaching staff. Some 
children made no othe1 sounds except grunts and squeaks, and, although these 
occur as items ia Gesell's scale, speech ages were not assigned on the basis of 
these alone, so that these children were omitted from the analysis of the results 
on speech. The amount of the child's spontaneous vocalisation during the 
examination was rated on a three-point scale. 


The following items were used for the investigation of comprehension : 


Response to a gestured instruction. 

Response to simple commands in a concrete situation (e.g., ‘‘ Shut the 
door, sit down, stand up”’). . 

Comprehension of ‘ give’ in relation to a real common object (e.g., 

` cup, spoon, ball). 

Comprehension of ' show ' in relation to a real common object. 

Response to the question: '" Would you like a sweet? ” 

Response to the statement: '' There's a sweet in the box." 

. The simple commands item No. 6 of year II of the Terman-Merrill 

scale (Form L). 


Data collected on social development inclnded the child's social response to 
the examiner and to a familiar nurse, his response to difficulty, and any attempts 
at communication or at drawing the examiner's attention to his activities. 
Information on the achievement of feeding skills was obtained from nursing 
staff. 
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IV.—RESULTS. 
(a) Performance ability. 


The number of children falling in each of Piaget's sensori-motor stages is 
shown in Table 1, together with the average age of the children in each stage. 


TABLE 1 


SENSORI-MOTOR STAGE AND LOCOMOTOR AGE. 


Locomotor age in years 


Highest point 





Twenty-five of the thirty-four Stage VI children succeeded with two or 
more of the performance tasks, their scores ranging from 1 to 21, but no child 
below Stage V succeeded with any of them, a difference which is significant at 
beyond the 0-001 level (133—35-4, d.£.—1). Of the nine Stage VI children who 
failed all these items, five showed signs of severe behaviour disorder in the form 
of visual avoidance, or prolonged distress. The other four were the youngest 
at this stage, aged 3 years to 4 years 4 months. 


(b) Locomotor development. 
Two locomotor ages for each child derived from the Gesell scale as follows : 


(1) The age level at which the child succeeded with all the items (basal age). 
(2) The highest point at which a success was achieved. 


The locomotor scores of children in each sensori-motor stage are shown in 
Table 1. From this it can be seen that the average locomotor age increases with 
successive sensori-motor stages, but there is a wide range at each stage. Never- 
theless, the difference between the means for the highest point score of Stages 
V and VI are statistically significant (t—8-42, p «0-001, d.£.—61). In view of 
the small numbers in Stages III and IV and of the small differences between 
the means, these two stages were combined ; the resulting average is significant- 
ly lower than the Stage V average (t—4-75, p «0-001, d.f.=47). In no instance 
was a child's locomotor age (highest point) level with his chronological age. 


A difference between the basal age and the highest point occurred in thirty- 
six cases. Among the children who could not walk without support there was a 
consistent pattern in the type of item which accounted for the scatter. The 
details are given in Table 2 for the twenty-one children whose upper limit 
(highest point) was below one year. 
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TABLE 2 


ANALYSIS OF ITEMS PASSED AND FAILED BY TWENTY-ONE CHILDREN SHOWING SCATTER ON 
THE GESELL LOCOMOTOR SCALE. 


Type of item passed (4-) and 
passed (—) at highest point 
Highest pomt of ——————— | —————— —-| Range of scatter between 
locomotor develop- | N. last standing 1tem +- and 
ment last sitting item + * 


8 to 24 weeks 
12 to 28 weeks 
4 to 8 weeks 
No standing items at 
these ages 














* Age level at which last success achieved for a sitting item subtracted from age level at 
which last success is achieved for a standing item. 


It will be noted that in nineteen cases the success at the highest point was a 
sitting item, and in one case only it was a standing item, and that a sitting item 
was failed at the highest point in only one case. Standing items, on the other 
hand, were failed at the highest point by fifteen of the sixteen children whose 
highest point was thirty-two weeks and over (there are no standing items below 
the age of thirty-two weeks). Thus, the scatter in the majority of cases was due 
to failure to show various standing responses over a range of the scale at which 
sitting responses were present. New admissions, who had recently come from 
home did not diffe: significantly from those who were already resident ia hospital 
either in locomotor score or in the relative number showing a scatter over the 
locomotor scale. 


Locomotor score (highest point) was positively correlated with performance 
score among the Stage VI children (rho—0-68, p=0-001). Hence, this factor 
was controlled in group comparisons, as was chronological age also, though a 
significant correlation between age and locomotor score (highest point) within 
any stage was found only for the group at Stage VI with performance scores of 
0 (rho=0-625). 


In view of the tendency of Mongols to lower muscle tone (Hilliard and 
Kirman, 1957), their locomotor development was compared with that of non- 
Mongols. Twenty-three pairs were matched for age, Piaget stage and perform- 
ance score, the average ages of the two groups being 4-96 years and 5-025 years, 
respectively. The mean locomoto: age (highest poy of the Mongols which was 
1-76 years (S.D.=0-98) differed very little from that of the non-Mongols 
(M=1-77, S.D.—0-96). The basal age means were similarly close (1-63 years 
and 1:64 years). 
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(c) Speech development. | 

In fifteea cases no sounds (apart from squeaks, crying and laughing) were 
heard during the examination, and the nursing staff similarly reported having 
heard no other sounds from these children. The ' speech age ' of the remaining 
sixty-eight children is shown in Table 3. 


TABLE 3 


LOCOMOTOR AND SPEECH DEVELOPMENT. 


Locomotor age Speech Age 
Sensori-motor Highest point |.——————-—— 
Stage Mean 


(years) 


0-617 
1-058 
1-660 
2-330 





The first interesting feature of this table is the wide range in speech age at 
any Piaget stage ; the second is the increase in the upper limit of speech develop- 
ment with successive stages. At Stages III and IV the most complex sound 
made by any child was a four-syllable babble ; jargon, interwoven with single 
words, was the most advanced development at Stage V. Two Stage V children 
produced two-word combinations in an echolalic manner, but meaningful and 
relevant word combinations and short sentences occurred only in the speech of 
some of the Stage VI children. 


The difference between the means of the Stage V and Stage VI children is 
statistically significant at beyond the 0-001 level (t—3-92 for 52 d.£). The 
Stage V average is also significantly different from the combined mean of 
Stages IV and III (t=3-52, 0-012 p» 0-001, d.f.—33). 


In view of the mutism that is sometimes associated with severe behaviour 
disorders, it was postulated that the absence of speech sounds, or very retarded 
speech development, would be related to behaviour disturbance in children 
at Stages V and VI, among whom absence of speech could scarcely be attributed 
merely to extremely slow development. Very retarded speech was defined as 
up to and including twenty-eight weeks (vowels only) for Stage V and up to and 
including thirty-six weeks (two-syllable babble) for Stage VI. Behaviour 
disorder was defined by marked visual avoidance or prolonged distress when 
approached, or mannerisms which warded off social contact. Eight of the 
eighteen children at Stages V and VI who were extremely retarded in speech 
development, or who made no sounds, showed signs of behaviour disorder, 
compared with three of the remaining forty-five children at these stages ; this 
difference is significant between the 0-01 and 0-001 levels.(x*— 10-25, with Yate's 
correction applied, d.f.— 1). 


Among the children at Stage VI, speech development was positively 
correlated with performance score when the four children with severe behaviour 
disorders were omitted (rho=0-38, p=0:05). Rank order correlations of age 
and speech development within each Piaget stage and performance score group 
were not significantly different from zero in any instance. 
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The bearing of the period of residence in hospital on speech development 
was examined by comparing the speech development of twenty-three new 
admissions and twenty-three others matched for age, Piaget stage and perform- 
ance score where applicable. The average speech ages of the two groups were 
0:83 years and 0-81 years, respectively. (When new admissions were excluded, 
there were too few children left at any stage for length of stay in hospital to be 
correlated with speech age.) 


Twenty-two pairs of Mongols and non-Mongols were matched for age, 
Piaget stage and performance score. A mean difference of 0-126 years in speech 
age, favouring the Mongols, was found, but there were six children with severe 
behaviour disorders in the non-Mongol group and non among the Mongols. 
Excluding these six children and their Mongol pairs, the mean speech ages for 
the remaining sixteen pairs were 1-028 years (S.D.--0-405) for the Mongols and 
0:896 (S.D.=0-5) for the non-Mongols; this difference was not statistically 
significant. 

À comparison of speech and locomotor development in the same children 
within each Piaget stage is made in Table 3, excluding the children who made no 
speech sounds. It will be seen that, for all groups, the average locomotor age is 
considerably in advance of the average speech age. 


For the whole group of sixty-eight children, the average speech age (0-92 
years) was significantly lower than the average locomotor age (1:82 years) for the 
highest point score. i4 for correlated means—9-82, p «0-001, d.f.—67). 


In fifty-five of the sixty-eight cases (81 per cent.), both the highest point of 
locomotor development and the basal age were higher than the speech age. In 
four cases one or both was the same as the speech age; both locomotor ages 
were lower than the speech age in four cases, and in the remaining five cases 
only the basal age was lower. 


(d) Verbal comprehension. 


None of the children classified at Stages III and IV showed any under- 
standing of speech, nor did twenty-five of those at Stage V, nine of whom 
responded to instructions given by gesture. The remaining four Stage V 
children gave evidence of verbal comprehension in simple concrete situations. 
All the children classified at Stage VI, on the other hand, showed evidence of 
some verbal comprehension. Fourteen of them responded only at the level of 
simple concrete situations ; of these nine failed the other items listed in Section 
III, but the other five were too disturbed for these to be administered. The 
remaining twenty children succeeded with three or more of the items listed in 
Section III (including concrete situations and excluding gestures). Comparing 
Stages V and VI for the number of children in three categories of verbal com- 
prehension (1) success with three or more items, (2) verbal comprehension in 
concrete situations only, and (3) none at all, the differences are significant at 
beyond the 0-001 level (53—50-36, d.f.—2). 


The relation between verbal comprehension and Piaget's sensori-motor 
stages was further investigated by an examination of clinical records, which 
contained the Piaget stage and the results of attempts to administer the 
Terman-Merrill revision of the Binet scale. Children who succeeded with one or 
more of the year II verbal items were invariably at sensori-motor Stage VI. 


E 
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(e) Drawing. 


The drawing ability of children at the various Piaget stages is shown in 
Table 4. 


TABLE 4 


SENSORI-MOTOR STAGE AND DRAWING. 


Sensori-motor stage 


4 











g 
Faint marks or dots ... .. 
Straight scribble 
Circular scribble .... 
Copy of lines  . 
Copy of circle, partial* 
Copy of circle, correct 
Copy of Cross 
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* By circular scribble. 


It will be seen that the use of a pencil begins to occur at Stage V, but 
extends to vigorous scribbling in only two cases. All but three of the Stage VI 
children, on the other hand, scribbled freely or copied models of straight lines 
and circles. (One refused to handle the pencil.) The differences between Stages 
V and VI, in three drawing categories ((1) none, (2) marks or dots, and (3) 
scribbles or copies) are significant at beyond the 0-001 level (x3$—28-1, d.f.=2). 


(f) Feeding skills. 


Feeding skills achieved by children at various Piaget stages are shown in 
Table 5. 
TABLE 5 


SENSORI-MOTOR STAGE AND FEEDING SKILLS. 


Sensori-motor Stage 
V 





Drinks if cup held 
Helps to hold cup 
Holds cup without help .... 


Does not feed self with fingers 
Feeds self with fingers 





A pet 
Ao; ong.) 





Spoon: Does not use 
Uses and spills 
Uses without spilling .... 


man 


bo 
© 


Total in each stage 
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(g) Social responses. 


All the children in the group showed awareness of the investigator and a 
nurse or teacher, most by looking at the investigator when she approached ; 
the social awareness of some was indicated in a negative way by visual avoid- 
ance. Others were distressed in the presence of the investigator, or began a 
mannerism or slapped their faces when approached. 

The response of smiling was found at all stages, though not all the children 
10 a given stage showed it ; the incidence was naturally affected by the behaviour 
described in the previous paragraph. Few children made a vocal response 
when talked to by the investigator, but spontaneous vocalisations occurred in 
more cases. A rating of | (few or no spontaneous vocalisations) compared with 
a rating of 2 or 3, was given to significantly more of the children who showed 
signs of behaviour disorder than to the rest (x*=20-02, p. «0-001, d.f.—2.). 

Differences in the amount of vocalisation (assessed by the ratings) between 
new admissions and children already resident in hospital were not statistically 
significant. 

Communication, in terms of imparting information or making a demand, 
was observed mainly among the Stage VI children, twenty-two of the thirty- 
four pointing for this purpose, with or without a vocal accompaniment. Only 
two Stage V children communicated, and none below this level did so. 

Other social responses were observed only among the Stage VI children. 
Seventeen (50 per cent.) of them drew the investigator’s attention to their 
activities, or kept looking to make sure the investigator was watching their 
performance; and twelve children sought the investigator’s help when in 
difficulty. 


V.—DISCUSSION AND CONCLUSIONS. 


There are two main questions arising from these findings that call for 
discussion. One concerns the pattern of development in severely subnormal 
children, and the other is the relation of other aspects of development to Piaget’s 
sensori-motor stages. 

The group is too deviant for locomotor and speech scores to be equated in 
terms of standard scores (in any case Gesell does not provide the necessary data 
for this). The expression of the results in terms of ‘locomotor ages’ and 
‘speech ages’ is probably inexact, and consequently comparisons of rates of 
development in different aspects are approximate. Nevertheless, the difference 
between the locomotor score and the speech score of the whole group is so large 
that it would appear that the severely subnormal, although grossly retarded in 
all areas of development, tend to develop much more rapidly in the locomotor 
sphere than they do in the acquisition of speech. If the approximate ages 
suggested by Piaget for the sensori-motor stages are accepted, then at all stages 
(with the possible exception of IIT) locomotor development appears to be in 
advance of the development of sensori-motor intelligence, which, in turn, is in 
advance of the development of speech. 

The twin study of Gesell and Thompson (1929) indicates that locomotor 
development is mainly influenced by maturation, since intensive training made 
little difference to subsequent locomotor scores. Much of Piaget’s account of 
the development of sensori-motor intelligence is a description of a learning 
process, and learning obviously enters into the acquisition of speech. Thus, it 
can be postulated from these results that the severely subnormal develop 
relatively more rapidly in the aspect which is mainly maturational, and that 
they develop relatively more slowly in aspects which depend more on learning. 
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Another finding on locomotor development which calls for further investiga- 
tion is that of a large scatter over the locomotor scale by the majority of the 
children who could not walk. This suggests that steps in the sequence of 
behaviour that lead to the attainment of the upright posture do not, in the 
severely subnormal, correspond with steps in the development of the sitting 
posture as in normal children. One possible psychological factor accounting for 
this is a difference in the amount of individual attention and encouragement to 
stand received by children in hospital and at home. New admissions did not, 
however, differ significantly from the children who had been resident in the 
hospital for a longer period, either in locomotor score, or in scatter over the 
locomotor scale. ^ Although children with cerebral palsy were excluded from 
the study, it is, of course, possible that there were motor lesions which were not 
observable clinically. Comparison of these data with post-mortem findings 
would be valuable. 

The limited verbal stimulation of children ın hospital might also be put 
forward to account for the finding of greater retardation in speech development, 
relative to locomotor development and the sensori-motor stages. The results of 
a number of studies (reviewed by McCarthy, 1954) have indicated that normal 
children living in institutions develop language more slowly than children living 
at home. A similar finding in respect of severely subnormal children has been 
reported by Lyle (1960). Hosever in the present study new admissions did not 
aufer in average speech score from children already resident in hospital. This 
Bnding is not necessarily at variance with Lyle's, since he investigated older 
children whose language was more advanced. They could identify and name 
objects, whereas the present study was mainly of pre-verbal utterances. It is 
thus possible that different types of environment become more important once 
words are beginning to be systematically acquired. The studies of normal 
children, however, give conflicting results as to the age at which an unstimulating 
environment begins to affect language development, some reporting an effect 
on phonemes in the first few months and others finding a difference only after 
the age of 2 years (McCarthy, 1954). The hypothesis that an unstimulating 
environment influences language development only after the beginning of the 
systematic use of speech might also account for the different results in respect 
of Mongols in this study and in Lyle's. Lyle (1960) found that Mongols, whether 
living in hospital or at home, were more backward in language than comparable 
non-Mongols. The Mongols of the present group did not differ significantly 
from non-Mongols. 

Further study might usefully be made of the findings of the present study 
that children with severe behaviour disorders tended to be very retarded in 
speech development or to make no sounds. This result also indicates that 
behaviour disorder is a variable that should be controlled in comparative studies 
of language development. 

The second main question to be discussed is whether children classified into 
Piaget's sensori-motor stages are clearly differentiated in other aspects of 
behaviour. For most aspects of development there is little difference between 
Stages III and IV. This is probably to be explained by the fact that all the 
children at Stage IV were only at the beginning of this stage. This appears to 
be a transitional stage, and very few of the severely subnormal children who 
have been observed have been at the end of the stage. Children are classified 
at the beginning of Stage IV if they brush aside a transparent screen in order to 
obtain a toy that it covers, though in the extent of their awareness of the perm- 
anence of objects they are the same as those at Stage III (recovering a half- 
hidden object, but not a completely hidden one). 
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However, significant differences in both locomotor and speech development 
were found between successive stages (III and IV together compared with V, 
and V compared with VI). Children at Stage VI were significantly differentiated 
from those at Stage V in the development of verbal comprehension. It is also 
of particular interest that it is only Stage VI children who can succeed with, or 
even comprehend, the first verbal items of the Revised Stanford-Binet scale. 
For these it is necessary to identify objects by their names and to carry out 
simple instructions about objects. This finding supports Piaget’s contention 
that the achievement of Stage VI, which involves a concept of the permanence of 
objects, is necessary for the acquisition of meaningful speech. Some children 
in the present group who were at Stage VI did not use meaningful speech, or 
identify objects by name ; whether these only lacked sufficient stimulation, or 
whether another factor is involved, as a matter for further investigation. 


It was also only Stage VI childrén who succeeded with any performance 
items drawn from pre-school scales. Thus, the attainment of Piaget’s sensori- 
motor Stage VI, at any rate in the severely subnormal, appears to be necessary 
for success with the earliest items, both verbal and non-verbal, of pre-school 
scales which are applicable to normal children from the age of 14 to 2 years. 


Results fo: other aspects of behaviour also underline the importance of the 
attainment of the end of the sensori-motor period in the development of severely 
subnormal children ; only Stage VI children were able to copy lines and a 
circle, and with two exceptions vigorous scribbling occurred only at this stage. 
Communication began to a ee at Stage V but was common only at Stage VI, 
and it was only Stage V1 children who sought help in difficulty, and who 
indicated that they hed their activities and achievements to be noticed by an 
adult. 

To turn to the practical implications of the findings, the first application is in 
clinical practice. All the See VI children in this study showed evidence of 
some verbal comprehension. Hence, if a child at Stage VI is found to have no 
verbal comprehension, then further investigation is indicated, particularly of 
the possibility of deafness. 


The results of this study also suggest when it is useful for nursing staff to 
spend time encouraging a child to use a cup and spoon. The findings indicate 
that children who have reached, Stage VI can certainly be taught to feed 
themselves, and they also suggest that it is worthwhile to persist with Stage V 
children. However, the attempt is probably fruitless until the child reaches 
these two stages. 


In the educational field the results indicate that children who have attained 
the last sensori-motor stage would benefit from attending a training centre, 
since new developments in language, drawing and performance ability occur 
during this stage, which appears from this study to be a major event in the 
development of severely subnormal children. 
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ARE RURAL CHILDREN HANDICAPPED BY THE USE 
OF SPEEDED TESTS IN SELECTION PROCEDURES? 


By 
C. ANNE MORETON 
(Charlotte Mason. Training College) 
AND 
H. J. BUTCHER 
(Department of Education, University of Manchester) 


Summary. [tis widely believed in country areas that the country child is ata 
disadvantage in the working of speeded tests. This hypothesis was tested by 
the comparison of two samples, each of about 300 children aged 10—11, one from 
rural Westmorland, one from central Manchester. Power and speed scores were 
obtained from N.F.E.R. tests of Arithmetic, English and Verbal Reasoning 

Almost all net differences were statistically significant in the expected direction, 
tending to confirm the hypothesis, but were not sufficiently large to suggest a 
need for revision of selection procedures. 


I.—INTRODUCTION. 


IN the course both of teaching in rural schools and also of supervising school 
practice in a rural area, one of the authors has frequently met with complaints 
Írom teachers and parents that the country child is at a disadvantage in the 
working of speeded tests used in selection procedures. This disadvantage is 
often attributed to the slower tempo of life in the country as compared with 
that in urban areas. 

The present research is confined to testing the hypothesis that such a 
disadvantage exists, without any detailed exploration of possible reasons or 
correlates. But it is believed to be the first systematic enquiry of its kind and 
may be timely in view of recent renewed interest in the concepts of speed and 
power in mental tasks (e.g., Furneaux, 1960). 


IL-—PREVIOUS RESEARCH. 


The literature on mental ‘speed’ and ‘ power’ is extensive, and often 
conflicting. About 150 references are given by Morrison (1960) and it is, 
therefore, outside the scope of this paper to offer anything like a compre- 
hensive survey. One of the difficulties in this field of research is that in practice 
measures of both speed and power are usually adulterated or relative, since a 
pure test of speed is of trivial application, and a pure test of power impracticable. 
Furneaux (1960) is probably correct in holding that such concepts as ‘ speed,’ 
‘power, ‘accuracy’ and ‘ difficulty’ require a new and more operational 
definition if conflicting results are to be avoided. However, certain trends— 
one might almost say fashions—in research do appear to emerge. 

_ In the earliest researches there was disagreement between those who main- 
tained that speed of reaction was positively correlated with higher mental 
processes (Gilbert, 1894, Baguley, 1901) and those who found no such relation- 
ship (Binet, 1900 ; Wissler, 1901; Whipple, 1904). As a result of further research 
(Burt, 1909; Brown, 1910; Kruger and Spearman, 1914; May, 1921; Bern- 
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stem, 1924; Spearman, 1927), the view that such a relationship does exist 
became generally accepted, although the relative importance and predictive 
power of these two components of mental ability is still not entirely settled. 


Since about 1930, attempts have been made, foreshadowed by Bernstein 
(1924), to interpret speed of mental work as a personality factor (e.g., Kennedy, 
1930 ; Himmelweit, 1946). In contrast, Eysenck (1953), discussing the work of 
Furneaux, has maintained that speed is the fundamental component of ability, 
and that measures of power, which depend on persistence, involve personality 
factors. 


Broadly speaking, however, the orthodox view appears still to be, firstly 
that speed and power are highly correlated, secondly that power is an important 
component, if not the more important component of general ability, thirdly, 
that tests, as used for example in the 11+ selection procedure, achieve an 
adequate compromise. Thus, Peel (1956), commenting on the kind of criticims 
that gave rise to the present investigation, concludes both that in general at 
11+ “ the abler child is the speedier child," and that in any case the time-limit 
imposed on most group tests in practice is not so stringent as to make speed an 
over-riding factor. 


Among psychologists as well as among the general public there are some 
who might still query this view (Porebski, 1954; Myers, 1952, 1960; Mollen- 
kopf, 1960). Myers, for example, writes : “ In timed selection and achievement 
tests, we may be defining the purposes of education as including, perhaps 
emphasising, the teaching of the value of snap judgment and educated guessing, 
over the value of deliberate careful thought.” 


In view of the continuing interest in these questions, it is mildly surprising 
to find how little research has been done into the social correlates of power and 
speed. Klineberg (1928), Lambeth and Lanier (1933) and Knapp (1960) 
investigated racial differences, but urban-rural differences appear to be relatively 
unexplored. The most relevant study known to the writers is that of Smith 
(1948), whose conclusions about Hebridean children include the following : 


“ It is clear in these circumstances that the time factor has a profound 
effect on test results over and above the normal effect. A rigidly timed test 
does not seem to discriminate very well in the range of the intelligence scale, 
because mental processes which are more slowly initiated and pursued than 
is usual are constantly being interrupted and suspended by the imposed speed 
of the examiner . . . There is little evidence of the modern tendency to race with 
time. It is rightly considered infinite and unlikely to interfere with the expected 
sequence of things ; so in the normal course of events there is no need for undue 
haste. Probably if the children had grasped more fully the significance of the 
stop-watch in the experiment, the scores would have been more flattering to 
them.” 


Smith’s research was primarily concerned with bilingual children, and 
there can be few rural areas comparable in isolation with the Outer Hebrides, 
yet it seems reasonable to postulate a gradation of natural tempo between such 
children and urban children. The aim of the present research is to determine 
whether English rural children in a relatively isolated area are handicapped to 
some degree in a similar way, 
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III.—CHOICE OF SAMPLES. 


(a) The Manchester sample. 

The Manchester sample of 297 was drawn from six schools in a lower 
working-class area near the centre of the city. As will be seen from Table 1, 
the largest number from any one school was 74 and the smallest number 36. 
In the total sample there were 153 boys and 144 girls. The average age was 
10:11. 


TABLE 1 














School No. | was a junior mixed school on the edge of a rapidly deteriorating 
area and, according to the headmistress, has over the past five years lost a 
number of good working-class families who have moved out to one of the newer 
estates. Two coloured boys were concerned in the tests, a half-caste and a 
Nigerian; many coloured families are now invading this part of Manchester. 
There were several ' problem ' families, but on the other hand, still many 
families who had lived in the district for one or two generations. 


School No. 2 was a junior and infant school Again, there was some 
evidence of deterioration, both to property and living standards in this area, 
although on the whole, the school had maintained its standards. There were 
several coloured children in the school who seemed to be at least of average 
ability and who also seemed to fit in very well with the community. 


School No. 3 was a junior school. It was in a similar type of area to that of 
School No. 2, although here there was more evidence of deterioration of 
property and living standards. The coloured children did not appear to be of 
the same calibre as those in the previous school. 


School No. 4 was a junior school. This school was in rather a better resi- 
dential working-class area but the population of the school was a little mixed 
and there was some slight evidence of a deterioration in the background and 
living standards of the children over the past few years, although actually the 
proportion of those selected for grammar schools was above average. 


School No. 5 was what is generally known as a slum school. There 15 a 
large floating population and many families live in rooms and sub-standard 
houses. There are about thirty coloured children in the school. It is interesting 
to note that at this school a coloured boy who took no extra time for any of the 
tests had both the highest speed raw score and quotient and also the highest 
time-limit raw score and quotient of the Manchester sample in Arithmetic, 
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School No. 6 was the nearest to the centre of the city. Some of the children 
were from very good working-class homes with interested parents, others were 
from poor homes both economically and socially. There was a small proportion 
of coloured children in the school and also a variety of nationalities (e.g., 
Spaniards, Nigerians and Indians) with attendant language difficulties. Con- 
sidering the prevailing living conditions, the number going forward to either 
grammar or technical schools or grammar-stream in secondary modern schools 
was relatively high. 


(b) The Westmorland Sample. 


The Westmorland sample of 274 was drawn from all the rural schools in the 
county except two which had to be withdrawn owing to administrative difficult- 
ies. The largest number from any one school was twelve and the smallest 
number one. There were 142 boys and 132 girls in this sample, with an average 
age of 10-8. The total number of schools involved was fifty-eight, the largest 
number on roll being ninety-six, and the smallest six. There were twenty-five 
one-teacher schools, twenty-seven two-teacher schools, five three-teacher 
schools and one four-teacher school. It is probably true to say that the 
majority of the children in these schools were the children of agricultural 
workers, although there is a slight but steady trend towards an increasing 
proportion of children whose fathers work in the towns and travel out each day. 
The majority of such parents are office workers and/or members of the pro- 
fessions. 


Considerable care was taken to confine the sample to genuinely rural areas, 
by the omission of even small towns and of areas that are fast becoming 
urbanised. The number of old Westmorland family names gives support to the 
assumption of a strong ‘native’ element. There are for example, thirteen of 
these known to the writers, all of which occur on an average four times. It is 
not claimed, however, that the two samples are equated for socio-economic 
status, and it is probable that the Westmorland sample contained a higher 
proportion of children from middle-class homes. 


TV.—METHOD. 
(a) The tests used and their administration. 


The tests used were the NFER Arithmetic Test 8, English Test 8, and 
Verbal Reasoning Test 8a. Considerable practical difficulties were envisaged in 
conducting a programme of testing over a wide rural area with poor communica- 
tions and a high proportion of one-teacher schools, in which such a programme 
can easily disrupt the whole school. 


The solution adopted was that the Westmorland results should be based on 
the Authority’s preliminary testing, with some additional instructions to 
supervisors, and that the same tests should be administered to the Manchester 
children under as closely similar conditions as possible (but without, of course, 
forming part of the Manchester Authority's official programme of testing). 


Such a solution is not ideal, and might even introduce differences in absolute 
level of performance, but it was considered unlikely seriously to disturb the 
relative differences between speed and power which form the main topic of 
investigation. 


26 Rural Children and Speeded Tests 


Instructions to Supervisors. 
In addition to the usual instructions for administration issued with the 
tests as to strict time limit, supervision and marking, the following instructions 
were sent to each school : 


Additional Instructions do Supervisors 
Arithmetic Test 8 (50 Minutes), English Test 8 (50 Minutes) 
Verbal Reasoning Tesi 8a (45 Minutes) 

(1) After each test has been in progress for 10 minutes, instruct the testees to 
place an ' X' in the margin. 

(2) Record time taken by each testee to complete test within the official time- 
limit. This to be recorded on list containing names and ages (in years and 
months). 

(3) At the earliest convenient time (possibly the following day) allow testees 
to have back their tests if they ate not complete, and to complete them in 
red pencil. (N.B.—AIl items omitted on the previous occasion must be 
completed.) 


(b) Types of Score desired. 

The ten-minute score is referred to as a Time-Limd Score following 
Davidson and Carroll’s (1945) definition as “ the number of items correctly 
answered within a prescribed time-limit." The score achieved on the normal 
working of the test, 1.e., on the first day and within the ' official ' time is referred 
to as the Speed Score. The distinction between ‘ time-limit score ' and ‘ speed 
Score ' is thus one of degree rather than of kind. Finally, the total score includ- 
ing any red pencil entries, is called the Power Score. Thus, the experiment was 
designed to give nine scores in all for each child, consisting of three types of 
score on each of three tests. 


(c) Method of standardisation. 

In order to test our hypothesis, it was necessary to make power scores, 
time-limit scores and speed scores all comparable. Conversion tables 
incorporating an age allowance and giving quotients were available in the case 
of speed scores, but not, of course, in the case of the other two types of score, 
which were peculiar to the experiment. 

The power and time-limit scores were, therefore, scaled to the same mean 
and standard deviation as the speed scores for the combined sample. It was 
then possible to convert these into quotients comparable with the speed 


quotients. A section of one of the conversion tables used is reproduced in 
Table 2. 


TABLE 2 


SECTION OF CONVERSION TABLE (ARITHMETIC TEST) 
(Ages 10 years—10-11 years) 


Age in years and completed months at tume of test 








10-6 | 10-7 | 10-8 
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V.—RESULTS. 


(a) A direct comparison of the two samples on speed, power and time-limit scores. 

Table 3 summarises the comparative performance of the two groups on 
the three tests and under the three conditions of work. The power and time- 
limit scores have been made comparable with the speed scores by the procedure 
already described, which also incorporates an age allowance : 


TABLE 3 
MEANS AND STANDARD DEVIATIONS—CONVERTED SCORES. 





103-94 13-47 | 101-82 11-14 
103-36 13-11 | 102-18 12-04 
104-16 13-19 | 101-87 12.02 


104-44 12-40 | 102-16 11-78 
104-07 12-44 | 102-58 12-12 
104-34 12-25 | 100-65 13-17 





It can be seen from Table 3 that any significant differences are in favour 
of Manchester group. Furthermore, all these differences are in respect of the 
speeded conditions of work (i.e., ‘ speed ' and ' time-limit ’). Thirdly, in each 
test, the more highly speeded the conditions the greater or more significant 
the difference in favour of the urban children. This is particularly noticeable 
in the case of arithmetic, but applies in some degree to the other two tests as 
well. 


(b) A comparison of the two sample: on difference scores (e.g., power minus speed). 


The results reported in the Jast section already suggested that the urban 
children were superior to the rural children to a markedly greater a in 
the speeded than in the unspeeded conditions, although absolute differences 
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were small. But a crucial test of the main hypothesis depends on a study of 
second-order differences, i.e., a comparison of the urban and rural children on 
power-score minus speed-score. 


Accordingly, the difference scores were tabulated and analysed, with 
results as shown in Table 4 below. 


TABLE 4 


COMPARISON OF WESTMORLAND AND MANCHESTER CHILDREN ON POWER SCORE MINUS 
SPEED SCORE. 


Manchester |Westmorland| Difference Critical Significance 
Ratio Level 





2:17 05 


3-80 01 
2-89 01 





It can be seen from Table 4 above that all three crucial differences, although 
small, are in the expected direction and are statistically significant. The-size 
of the difference is largest for English and smallest for arithmetic, although the 
range is not very great, and should in any case not be interpreted too strictly, 
since the standardising procedure in the experiment ensured comparability of 
speed and power scores for any one test, but not necessarily between tests. 

A similar comparison in respect of power scores minus time-limit scores 
1s shown below in Table 5. 


TABLE 5 


COMPARISON OF WESTMORLAND AND MANCHESTER CHILDREN ON Power Sconz MINUS 
TIME-LIMIT SCORE 


Manchester | Westmorland| Difference 








The second-order differences shown in Table 5 are again all in the hypothesised 
direction, two being highly significant, and the third close to statistical sig- 
nificance. Since the ' time-limit ’ scores are formally similar to ' speed ' scores, 
but based on a shorter period of testing time, it might have been supposed that 
less conclusive results would be obtained than in the comparison between power 
Scores and speed scores. The indication is, however, that even within the short 
space of ten minutes the measurement of speeded work operates to some extent 
in favour of urban children. 
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VIL.—CONCLUSIONS. 


1.— Children from a working-class area of central Manchester were compared 
with children from a rural area of Westmorland on moderately speeded tests of 
arithmetic, English and verbal reasoning as used in an 11+ selection procedure. 
Manchester scores were slightly higher on the average in all three papers, two 
of the three differences being statistically significant. 


2.—Scores were obtained after 10 minutes’ work on the forty-five minute 
or fifty-minute tests. Rather large: differences were found in favour of the 
Manchester children. 


3.—When the children were allowed unlimited time to complete the papers, 
the mean differences between the groups were reduced, and none was significant. 


4,—Power-minus-speed scores were calculated for each child. Westmorland 
children were slightly but significantly higher on all three papers. 

5.—The hypothesis that speeded tests to some degree handicap rural 
children was considered to be confirmed. Relative differences, however, being 
of the order of one unit of I.Q., were so small as not to suggest any revision of 
the selection procedure. 
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SOME CHARACTERISTICS OF CHILDREN WITH TWO 
TYPES OF SPEECH DISORDER 


By 
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(Si. John's Hospital, Aylesbury) 
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(Stoke Mandeville Hospital, Aylesbury) 


Summary. An investigation was made of twenty successive cases of develop- 
mental speech defects. They were divided according to the type of speech 
disorder (Greene 1960) into a ‘ Motor ’ group (eight children) and a‘ Language ' 
group (twelve children). In the Motor group, but not the Language, there was 
evidence of delay in general motor maturation. The two groups were compared 
on measures of mechanical reading, consistency of laterality and W.I.S.C. 
subtests. Both groups were very poor at reading, but whereas all the Motor 
group had inconsistent laterality, only a third (i.e., a normal proportion) of the 
Language were inconsistent ; on the other hand, the Motor group had a much 
better knowledge of right and left. The Language group, but not the Motor, 
had poor scores on the Arithmetic and Digit Span subtests. Also the Language 
group was significantly worse than the Motor on Block Designs. 

It'was suggested that these might represent two distinct groups of develop- 
mental language defects, and that recognition of the two might help to explain 
some of the contradictions in the literature. 


I.—CLINICAL BACKGROUND. 


Tuts is a preliminary attempt to measure by psychological tests certain 
clinically observed features of children suffering from dyslalia and referred 
to the speech therapist between the ages of 3-6 years. It was possible for the 
children to be sorted into two groups on the basis of speech performance alone 
as already described by Greene (1960) and classified as suffering from a ‘ motor ' 
or ‘language ’ disorder. The speech of the youngest children at first exhibited 
characteristics of both motor and language disorder. This was to be expected 
since the normal child has the same difficulties in handling articulation and 
language when learning to speak, though to a lesser degree. By the age of 4 : 0- 
4:6, the pattern of a predominantly motor disorder on the one hand and a 
language disorder on the other had emerged. Diagnosis in each case was 
supported by a careful case history of general and speech development and the 
tests for motor and language maturation detailed below. 


Motor Disorder Group. 


The children classified as suffering from motor disorder had great difficulty 
with articulation and the accurate imitation of consonants and vowels, 
especially complex consonant blends such as sp, sk, str, etc. The omission and 
substitution of consonants followed a pattern which reduced the conventional 
phonetic sequences of the language to simpler forms, for example ' biscuit ' 
pronounced ‘bitit, ‘swim’ as ‘fim.’ Diphthongs were frequently reduced to 
vowels as when ' boy ' was pronounced ‘bore.’ These mispronunciations were 
consistent as when ‘sk’ was invariably substituted by ‘t’ and ‘sw’ by ‘f; 
Once the child's individual code was known, it was possible to understand what 
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he said. Vocabulary and language development were normal for age and the 
Watts picture tests presented no difficulty. These children did not have 
dificulty with auditory discrimination and distinguished readily between right 
and wrong pronunciation when spoken by the therapist. Auditory memory was 
normal for age when tested with nonsense syllables and the standard sentences 
from the Terman-Merrill Intelhgence Scale. Drawing was good and they had 
no constructional or spatial difficulty. 


The developmental history and clinical testing of motor maturation 
supported the diagnosis of delayed motor maturation. Walking was late. 
The mean age of walking of the group used in the investigation (see Method 
following) being 16-6 months, range 14-23, whereas the mean of the language 
group was 14-4 months, range 10-18. 


Accurate dating of commencement of speech was not possible in every case, 
since the mother, whose report was the sole source of information, was not able 
to determine when pre-linguistic babbling merged into speech ; some mothers in 
both groups were sure their child had never babbled in the first year. 


Clinical tests of motor maturation included closing fist and thumb, thumb 
and finger opposition, and cutting paper with scissors (Merrill Palmer) ; hopping 
on one foot (Gesell) and ability to protrude and lift tongue-tip (Morley 1957), 
besides ability to imitate articulatory movements in consonant and vowel 
syllables. All children exhibited signs of late motor maturation and poor co- 
ordination which appeared to stem from a difficulty in isolating movements, 
with delay in the normal synkinetic evolution as described by de Ajuriaguerra 
and Stambak (1955). 


The Language Disorder Group. 


The language disorder group did not suffer from a similar delay in motor 
maturation, they crawled, sat and walked within normal limits and performed 
well on co-ordination tests. They had little or no difficulty in direct imitation 
of consonant blends and words but had considerable difficulty in recalling the 
correct phonetic sequences and also grammatical constructions in spontaneous 
speech. Their speech was characterised by inconsistent substitutions and 
omissions which rendered speech often incomprehensible even to the mother. 
They made unpredictable mistakes in the order of sounds within words and 
words within sentences. Conjugation of verbs was poor and they peformed 
badly on the Watts language picture tests. They had difficulty in remembering 
names of things and people. They could discriminate by ear perfectly when 
pictures of similar sounding articles were named and when the correct pro- 
nunciation was contrasted with the child's error. They could not determine 
whether a word was right or wrong, if presented with their own erroneous 
version, as in the case of the child who said ' umblella' or 'lumbella' for 
umbrella. 


Auditory memory as tested by repetition of groups of nonsense syllables 
and the standard sentences from the Terman-Merrill Intelligence Scale was 
deficient. Some of the children had a constructional or spatial difficulty and 
failed the Merrill-Palmer tests for copying a cross, a star and the three and six- 
cube pyramids and assembly of the Manikin. Drawing showed an inability to 
appreciate form and orientation. For these reasons it was felt that these 
children had a deficient sense of order in time and space associated with the 
difficulty in retention and recall of auditory material. 
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IL.—METHOD. 


The subjects were children with severe speech defects referred for treatment 
to the speech therapist (M.C.L.G.). None of them had any physical malforma- 
tion; clinical and audiometric examinations by the appropriate authorities 
had revealed no neurological or auditory abnormality. Those who fell into one 
of the two categories described above were referred to the Psychologist (T.G.C.), 
who carried out the tests. The speech therapist made the differential diagnosis 
between motor and language before seeing the test results, but the diagnosis 
was not known to the psychologist until the whole series had been tested 
The children had to be at least five years old when the testing was carried out, 
because this is the minimum age for one of the tests used (the W.LS.C.) ; 
also they were not sent until their speech was at least intelligible to a person 
unfamiliar with them. All appropriate cases were referred for testing until a 
total of 20 was reached (8 motor (7MIF) and 12 language (8M4F)). This took 
neaily two years. In the testing session, the following investigations were 
carried out : 


W.I.S.C. The LQ. was calculated from seven subtests: Vocabulary, 
Similarities, Arithmetic, Digit Span, Picture Completion, Block Designs and 
Coding. Any child with a full I.Q. below 85 was excluded, to avoid cases where 
speech disability was due to inadequate generalintelligence. The subtests were 
used for the investigation of immediate memory and ordering ability. The 
advantage of having standardized scores was thought to outweigh the dis- 
advantage of using tests which were not specially devised ' pure ' measures of 
the abilities concerned. From the way in which the scores are standardized, 
a child's Scaled Score on any subtests can be compared directly with his scores 
on other subtests, and also with the scores of other children of different ages. 
Digit Span was used as a measure of immediate memory, Arithmetic and Block 
Designs as measures of ordering ability, Arithmetic, at this level, being a measure 
of the child's familiarity with the sequence of numbers and the ability to apply 
it to simple addition, Block Designs measuring the ability to copy groups of 
objects in their correct spatial relationships. They were compared with 
Vocabulary and Similarities, which were considered the most free from ' order- 
ing.’ The other two subtests were omitted, as not falling definitely into either 
category. It was predicted that the Language group would be relatively poor 
on the Digit Span, Arithmetic and Block Designs, compared with their Vocabul- 
ary and Similarities. 


Reading, etc. In Hallgren’s (1950) large study of reading disability, the 
only factor of those studied which showed a highly significant relationship to 
dyslexia was the presence of speech defects. The reading of our cases was 
examined to see if it was poor, and to what extent. No hypothesis was made 
about the difference between the two groups. A mechanical reading test was 
used—Schonell’s Graded Word Reading Test; this was thought preferable 
to a comprehension test, as being less dependent on intelligence. The reading 
age obtained from this was converted to a quotient in the usual way, so that it 
could be compared approximately with the 1.Q. As it was not thought appro- 
priate to test a child's reading before the age of 6, any child who was not yet 6 
on the first testing was seen later, just for the reading, when he was at least 6. 


Each child was asked to write his name and two simple sentences, ‘a big dog’ 
and ‘I like apples,’ chosen to bring out reversals or transpositions of letters 
(what Orton (1937) calls ‘static’ and ‘kinetic’ reversals) since these are 
important to theories which link speech and reading disabilities with peculiarities 
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of laterality. This was scored simply for the presence or absence of reversals of 
either kind. The letters were dictated to children who could not spell words. 

Laterality. The history of the study of the relationship between speech and 
reading difficulties and various types of laterality (left handedness, mixed or | 
crossed laterality, etc.), is a long and contradictory one, well summarized by 
Vernon (1957). No prediction was made as to differences in laterality between 
the two groups. The assessment was made by a simple group of observations 
of the ' choice ' type: hand used for writing, hand for throwing (five trials), 
hand for catching, when instructed to use one hand (three trials), eye for 
counting fingers looking through a small hole in a piece of paper held in two 
hands (three trials, in three directions), eye used in naming pictures through a 
* V-scope ' (Miles’ ABC Vision Test) (five trials), and foot for kicking (three 
trials). Clark (1957) found kicking the best and most consistent test of footed- 
ness; a matchbox was used instead of a ball to facilitate central placement. 
Theie are twenty items in all, and they were scored just as ' consistent ' or 
‘inconsistent.’ The criterion for ‘ consistent ' was nineteen or twenty items the 
same side, unless the one different was the hand used for writing ; anything less 
than nineteen was ' inconsistent,' though as it turned out the highest score in the 
* inconsistent ' group was seventeen, and the ' consistent ' all scored twenty. 

The children were also asked to indicate their left foot, right ear, etc. 
(five trials) and to perform crossed movements (e.g., “ touch your right ear with 
your left hand ") (three trials). These two sections were simply scored as 
‘ correct ,’ if all trials were correct, or ‘ incorrect.’ 


III.—RzsurrS. 
Intelligence and Analysis of Subtests. 


TABLE 1 


INTELLIGENCE AND SUBTEST SCORES OF THE Two GROUPS. 





The two groups differ significantly in the expected direction on the tests 
where a difference was predicted, while differences in Vocabulary and Simil- 
arities are not significant. Unfortunately, the Motor group have a considerably 
higher 1.Q. than the Language group, so that they would be expected to be 
significantly better on some subtests. To eliminate the effect of I.Q., scores on 
each of the three critical subtests were subtracted from the mean of Vocabulary 
and Similarities for each individual. If the two groups are compared with 
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respect to these differences, we have an estimate of their relative efficiency on 
the critical tasks, as compared with the neutral ones, 1.0. being held constant. 
Also, from the way in which the scaled scores are derived, we know that random 
groups of the population would be expected to have a mean difference of nought 
when any two subtests are compared ; so by examining the approximation of 
our group mean differences to nought, we can estimate whether they differ from 
the general PI in their relative skill on these tests, I.Q. again being kept 
constant. Table 2 shows the result of these two operations. 


It will be seen from Table 2 that the means of the Language group all 
deviate from nought in the predicted direction, but only one is significant, 
Digit Span. In comparing the two groups all differences are in the expected 
direction and two are significant at the 5 per cent. level. If a one-tail test is 
used (as is reasonable since the results were predicted) all three are significant 
at the 5 per cent. level. Thus, the Language group are poorer than the Motor 
group on the three tests predicted, but compared with the general population 
on only one test, Digit Span, is their low scoring significant. 

Reading. Table 3 shows the mechanical reading quotients of the two groups 
compared with their I.Os. One child was lost from each group ; these both did 
the original testing at the age of 5, and could not be obtamed for a second 
assessment after their sixth birthday. Neither of these could read any words, 
and they only named about half of the letters in the first line of the test correctly 
(4 and 6 respectively out of 10, I.Qs. 105 and 116, ages 5 years 4 months and 
5 years 6 months), so their reading was probably retarded. The Table shows 
that both groups were considerably backward at mechanical reading, and both 
to about the same extent. Four out of seven of the Motor group and five out of 
eleven of the Language had reading quotients more than 30 points below the 
I.Q. N o of the (I.Q.—R.Q.) means of the two groups gives t= +50, 
P> 6. 


TABLE 3 


MECHANICAL READING QUOTIENTS. 


Motor Group age Group 
(N=7) ar ae 


M=7-44 M-==7-64 
Range6-3—11-1 Range =6-1—9-8 


M=115-00 M=99-82 
Range=105—131 Range--88—115 


M=86-14 M73-64 
Ranges 72—107 Range= 61—87 


M= +28-86 M= +26:18 
Range- +13—+38 Range= -+8—-+-45 





Laterality, Knowledge of Sides and Reversals. 

All eight of the Motor Group were ' inconsistent ' on the laterality tests, 
but only on out of twelve of the Language Group. The probability of getting 
such an extreme division as this, by direct calculation (two-tailed) is -008. There 
is no obvious pattern in the type of inconsistency shown by the Motor Group. 
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Putting the predominant choice for each member in the order hand, eye, foot, 
there are three RLR, two LLR, one RRL, one RLL and 1 LRL. Four of them 
also had inconsistencies within one member. In the Language Group there 
were eight RRR, one RRL, two RLR and one LRR. From the data given by 
-Hanis (1949) the proportion of inconsistent in the Language Group is about 
what would be expected in an unselected sample of children of this age group. 


From Table 1, it can be seen that the Motor Group are older and more 
intelligent than the Language Group, so this lack of establishment of laterality 
cannot be ascribed to lesser general maturity. 


The ability to name one's sides is probably an all-or-none ability. Accord- 
ing to Launay (19522), nearly all dyslexic children can do it by the age of 8 
and nearly all normal children by the age of 7. Thus, it is unlikely to be dis- 
criminating after the age of 8, and we restricted this comparison to children 
between the ages of 5 years 0 months and 7 years 11 months. This left seven of 
the Motor and nine of the Language. Five Motor and only two Language were 
correct in simple indication of sides, and all seven were of inconsistent laterality. 
The nine who made mistakes were three inconsistent and six consistent. The 
probability of drawing all seven inconsistent from a group of ten inconsistent 
and six consistent (two-tailed) is -021. Unfortunately, knowledge of sides in 
this age-group is clearly related to intelligence. The Mean I.Q. of the ' correct ' 
group is 112-3, of the ‘incorrect ' group 98-8. However, there is certainly no 
evidence of a relationship between inconsistency of laterality and lack of 
knowledge of sides, and in any case the ‘incorrect’ group is mostly drawn from 
the Language group whose I.Q. is thought to be lowered by their special dis- 
abilities. There were four 7-year-olds in the Language group, and they all 
made mistakes; so they are probably worse at this than normal; the three 
7-year-olds in the motor group were all correct. Only three of the whole series 
were able to carry out crossed movements, two motor and one language, all 
three of iaconsistent laterality. 


In considering the question of reversals of writing, the second test of the 
original 5-year-olds was used. As two were missed, this gives six Motor and 
eight Language between the ages of 6 years 0 months and 7 years 11 months. 
Four of the Motor group and six of the Language showed reversals, five out of 
six with consistent laterality and five out of eight inconsistent. There is no 
relationship here between reversals and type of speech disorder or type of 
laterality. The degree of reversal varied greatly from just one error (' b ' for 
' d"), to a complete confusion of all letters. This clearly needs a more syste- 
matic method of study. 


IV.—DIscussion. 


The prediction of the relatively poor performance of the Language group 
on Digit Span, Arithmetic and Block Designs was borne out in the case of 
Digit Span and Arithmetic. The Motor group were significantly better than 
the Language group on the Block Designs, but this was due to the fact that the 
Motor group were better on these than on other tests, not that the Language 
group were worse. Digit Span, the most discriminating test, is a test of 
‘immediate auditory memory,’ but in memory tests it is always advisable to 
ask ' memory for what ?’, and in this case it is memory for numbers in order ; 
mistakes are often mistakes of order, the correct digits being given but in the 
wrong sequence. So this can, perhaps, also be taken as evidence for the poor 
ordering ability of the Language group. 
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The most interesting finding is the difference between the two groups in 
consistency of laterality, the Motor group being all inconsistent, the Language 
group coataining a ‘normal’ number of children with inconsistent laterality. 
This seems to provide a further definite basis for distinguishing the two types, 
in addition to the speech differences. It may offer a lead to the solution of the 
problem of the contradictory findings about laterality in reading disabilities. 


We have two groups of children, who can be characterized in the following 
way: Group I—Speech disability of the ‘motor’ type, poor reading, in- 
consistent laterality, no deficiency of immediate auditory memory or ordering 
ability ; Group 2—Speech disability of the ‘language’ type, poor reading, 

‘normal’ laterality, poor immediate auditory memory and r ordering 
ability. In addition, knowledge of sides does not appear to be related to later- 
ality, and tendency to make reversals in writing is not related to inconsistent 
laterality or to poor ordering ability as assessed by other tests; on these two 
points, however, the evidence is rather flimsy. In absolute terms, Group 2 
are considerably worse readers than Group 1. Not one of them reaches a 
quotient of 90. The two groups show the same discrepancy between reading 
and I.O., but it must be remembered that the I.Q. is partly composed of tests 
in which Group 2 are thought to do badly because of their language disability. 


In spite of the negative evidence, there are so many studies showing a 
‘relationship between poor reading and peculiarities of laterality that there may 
be some truth in it. The easiest way of explaining the discrepancy in the 
findings would be to suppose the existence of two or more types of disability, 
and our Motor group appears to be one quite well-defined type, corresponding 
roughly to what Vernon (1957) calls “ a class of individuals who are generally 
lacking in maturation" (page 109), though we should perhaps qualify it as 

‘ motor maturation,’ since they are of good intelligence, with little variation in 
abilities. The other group is not so well defined, the nature of their deficiency 
in ‘ ordering ’ ability not having been demonstrated in a completely satisfactory 
way. Block Designs failed to bring out clearly the clinically observed spatial 
difficulties of the Language Group, although they were, in fact, significantly 
worse on them than the Motor group. This may be due to the fact that it is a 
copying test. It is well known that children with severe difficulty in writing 
letters from memory can often copy them quite well. Perhaps a creative rather 
than a copying type of test would be more discriminating ; de Hirsch (1957) 
considers a draw-a-man test to be useful for this purpose. It is hoped to 
investigate this further with more refined procedures. 


Much of the study of developmental language defects in children has been 
concerned with the association of three things: (s) the actual disability in 
speech, reading and writing ; (b) peculiarities of laterality ; (c) defects in spatial 
and temporal ordering. There are a prion reasons for linking (b) and (c) but 
it seems possible that it is the attempt to do so which accounts for some of the 
confusion ; our results suggest two groups, one with (a) and (b), one with (a) 
and (c), with no connection between (b) and (c). The linkage of (a), (b) and (e) is 
usually associated with the notion of a ‘ dominant hemisphere’; the failure of 
one hemisphere to become completely dominant would result directly in (D), 
and also in (a), since it has long been considered that speech is somehow located 
in the dominant hemisphere ; (c) could be thought of as arising directly from the 
lack of dominance, in the way suggested by Orton, or it could be secondary, a 
consequence of (b), which might make it more difficult for the child to develop 
a consistent sense of direction related to his own body. However, recent findin 
(Humphrey and Zangwill (1952), Penfield and Roberts (1959), Zangwill (1960)) 
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suggest that the area concerned with s tends to be in the left hemisphere 
in left-handed as well as right-handed people, in other words that it is not 
related to dominance. There is certainly a sense in which one hemisphere is 
{or becomes) dominant, if only as a description of the fact that most people 
are more skilful with one hand than the other, and tend to choose that hand 
(and the corresponding foot) for skilled operations. Our Motor group certainly 
lack dominance in this sense, so it looks as if motor skill in speech is related to 
speech defect ; after all, the written symbol is based on the spoken, so if the 
first symbol system is faulty, the second must suffer. The Language Group, on 
the other hand, could be considered as having a defect in a speech or language 
area, not related to dominance, or perhaps in some system concerned with a 
more general function. Their difficulties bear at least a superficial resemblance 
to Gerstmann’s syndrome, their poor reading and writing, poor arithmetic and 
sense of order, and lack of knowledge of left and right. Drew (1956) considers 
both Gerstmann’s syndrome and ‘ congenital word-blindness ’ to be based on a 
failure in Gestalt formation. In contrast, the Motor group are good at Arith- 
metic, and their knowledge of sides good for their age (c.f. Launay (1952a)). 
Tt is an interesting feature of this study, and worthy of further investigation, 
that the expectation that confusion of laterality and poor knowledge of sides 
would go together was not confirmed. They appear to be unrelated. 


Another source of confusion in this subject is the question of reversals in 
reading and writing. The evidence is convincing (Vernon (1957), Chap. V) that 
such reversals are related to poor reading but not to laterality. If a child is 
going to make a mistake in writing, it is quite likely that the result will be such 
that it can be interpreted as a reversal of some kind. It could be that attention 
is drawn to this kind of error to the exclusion of other kinds. This point is 
clearly brought out by Miles (1961), in his discussion of the mistakes made by 
two children with ‘ developmental aphasia.’ It is clear from his analysis that 
the disorientation which he considers to be at the root of the problem is a 
highly complex thing, involving space and time. It is interesting to note that 
his two cases of severe disability were both of consistent laterality. It is well 
known that nearly all children show reversals at some stage of learning to read 
and write (c.f., Launay, 1952). It might be that some reversals are just mistakes, 
while others are symptomatic of a genuine disorder of orientation. 


It is recognised that much of this discussion is speculative and goes beyond 
the facts. However, we feel that we have distinguished two types of develop- 
mental language disorder and that investigation with the two types in mind 
may help to clarify some of the issues. 
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AN ANALYSIS OF C. W. VALENTINE’S REASONING TEST 
FOR HIGHER LEVELS OF INTELLIGENCE 


By H. J. HALLWORTH 
(Depariment of Education, University of Birmingham) 


SuMMARY. Answers to C. W. Valentine's reasoning test for higher levels of 
intelligence made by 286 men and 262 women were analysed and compared. 
Analysis of the item inter-correlations by the method of principal components 
gave a factor of general reasoning ability for both men and women. Upon 
rotation to the Varimax and Quartimax criteria three group factors were 
obtained which were comparable for the two groups of subjects. These factors 
are described and tentatively named. The results indicate the potential value 
of a detailed examination of the thinking processes involved in answering the 
items of this test. 


I.—INTRODUCTION. 


Uron the publication of his article on “ The use of a new reasoning test for 
selection of university and training college students ” (Valentine, 1961), Professor 
Valentine kindly made available for further analysis the test scripts of 548 
university and training college students and sixth form pupils. This article 
reports the results of the analysis. s 


II.—METHOD OF ANALYSIS. 


The scripts were divided into two groups, 286 completed by men, 262 by 
women. Using an electronic computer, two similar analyses were carried out, 
one for each group. 

The test gives a score for each of sixteen different items. Including total 
test score and age, there were, therefore, scores on eighteen variables for each 
subject. Means, standard deviations and product-moment inter-correlations 
were obtained for all variables. The correlation matrix for the sixteen test 
items was analysed by the method of principal components, and the factor 
loadings were rotated to the Varimax (Kaiser, 1958) and Quartimax (Neuhaus 
and Wrigley, 1954) criteria.* 


III.—MEANS AND STANDARD DEVIATIONS. 


The mean age of the men was 22 years 3 months, and of the women 19 years 
4 months. This difference was due to the fact that, at the time of testing, 
national service delayed the college entry of most men. The mean test score for 
men was 28-9, with a standard deviation of 14-5; the mean score for women 
was 24-1, with a standard deviation of 12:7. These figures are consistent with 
those given in the original article. 

On fourteen of the test items men had a higher mean score than women. 
For two items women had a higher mean score, but in neither case was the 
difference between men's and women's means significant. It is, however, of 
interest that these two items ate different from all others in that they each 
combine four characteristics : 


(1) Each specifies one attribute of a logical class, using the words “ None 
but..." 


* All programmes used are for a Ferranti Mercury computer and were written by the 
anthor in mixed Antocode and machine code. (Hallworth, 1962 ) 
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(2) Each ee another attribute of the same logical class, using in effect the 
wording “ All. 


(3) Neither of the two datis is exclusive to the class. 


(4) The answer demands an appreciation of the fact that the logical classes 
defined by each of the two attributes are not necessarily co-incident. 


Precise information regarding adult sex differences in patterns of reasoning 
is not often available. This finding is recorded because, should comparable 
information appear from another sample using this test, or from another source, 
it would surely demand further investigation of the thinking processes involved. 

As would be expected, the men had a higher standard deviation than the 
women for most items. 


IV.—CORRELATIONS WITH TOTAL SCORE. 


Correlations of total score with age were insignificant for both men and 
women, being —-05 and + -09, respectively. The negative correlation for men 
was probably due to the inclusion in the sample of a number of sixth formers who 
had recently gained university open scholarships. 


The correlation of each item with the total score is given in the table below. 
In general the figures are very satisfactory. The lower coefficients for the last 
two items are due to the greater difficulty of these items, a matter which is 
discussed again later. 


TABLE 1 


CORRELATIONS OF ITEMS WrrH TOTAL SCORE. 








V.—ANALYSIS OF THE ITEMS. 


An obvious question about a test which measures reasoning ability at 
higher levels of intelligence is, does it measure one or more kinds of reasoning 
ability ? The generally accepted view is that reasoning ability depends largely 
on 'g' (Valentine, 1961, p. 228), but that there may be differences between 
inductive and deductive reasoning, verbal reasoning and numerical reasoning, 
etc. In the original article, Professor Valentine indicates that the first four 
items of the test were constructed to measure inductive reasoning ; the remain- 
ing items are concerned with deduction. We should, therefore, predict a factor 
of general reasoning ability, and a factor or factors to differentiate between the 
first four and the remaining items. 


(a) A General Factor. 


From the correlation matrix of the sixteen test items for each of the two 
groups of subjects, eight principal components were extracted. These accounted 
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for a total of 69 per cent. of the variance for the men’s group and 67 per cent. for 
the women’s group. In each case the first component could be regarded as a 
general factor, accounting for 22 per cent. of the variance for the men’s group 
and 18 per cent. for the women’s group. Item loadings on this component are 
given in Table 2. 


TABLE 2 


ITEM LOADINGS ON FIRST PRINCIPAL COMPONENT. 


1[2]3 








‘43 | :44 | -49 


+40 | -35 | -29 





“49 | -42 PIE :28 | -51 


A first component of this kind is, of course, to be expected from this method 
of analysis. The question is, should the analysis be accepted as it stands, or 
should the axes be rotated? Since there are always many possible sets of factors 
the decision to accept one set rather than any other must be made on psycho- 
logical grounds. In the present case, acceptance of these components without 
rotation would give the general factor which has been postulated, and in this 
sense, the prediction is verified. In popular terms, this could perhaps be regarded 
as a typical British solution to the problem, extracting as large a common 
factor as possible. 


However, the remaining components were not comparable across the two 
groups, nor did they make very good psychological sense in either case. It was, 
therefore, decided to rotate the axes in an attempt to discover another set of 
factors, more of which would have good psychological meaning. 


(b) Group Factors. 

For both groups, all eight components were rotated to the Varimax 
criterion. This is claimed by Kaiser (1958) to give results approximating to 
Thurstone’s ‘simple structure,’ therefore, we may expect a more typically 
American solution. The results are given in Table 3. 

Since the method of rotation tends to spread loadings more evenly between a 
number of group factors, the general factor has disappeared, and there are now 
three factors which are fairly comparable for both groups of subjects. 


Varimax Factor 1 has its heaviest loadings on items 9, 12 and 13. In order 
to interpret this factor psychologically, we must first note that in section two 
of the test there are, as explained by Professor Valentine (1961), two kinds of 
item : first, items requiring ability to perceive a correct argument ; secondly, 
items requiring ability to perceive an incorrect argument and know why it is 
incorrect. Items 6, 9, 12 and 13 are the only items of the first type. It would 
probably be agreed that they can be answered more ' intuitively ' than can the 
other times. The relatively low loading of item 6 may be accounted for by the 
greater complexity of argument in this item. Similarly, the relatively high 
loadings of items 10 and 11 may be accounted for by the brief wording of the 
argument and relative ease of answering. If so, the best genes pu 
of this factor is ' intuitive reasoning ’. RS 
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TABLE 3 


VARIMAX FACTORS FOR SIXTEEN TEST ITEMS, 








FACTOR 


pt 


Item 
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Varimax Factor II, for women, has heavy loadings on items 1-4, inclusive, 
and may be labelled ‘inductive reasoning.’ This factor was predicted. For 
men, the most comparable factor is IIA, which has, however, a low loading on 
item 2. Factor ITB is shown because this has a high loading on the missing 
item 2, also on item 11. There is no immediate logical similarity between items 
2 and 11, but it may be remarked that on Factor II for the women, item 11 has 
the highest loading after the ‘inductive’ items 2-4. Here again, further 
investigation of the thinking processes involved might yield interesting results. 


Varimax Factor III has its heaviest loadings on items 15 and 16 for both 
groups ofsubjects. These two items, the last of the test, are of a different order 
of diffüculty from all others and yield a relatively low mean score for both 
men and women. Both involve relationships between five categories, and the 
answers involve double negation. Although Factor III was not predicted, it 
.may be interpreted as relating to ability to deal with more complicated reasoning 
problems. 
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(c) Further Group Factors. 

The Quartimax method of rotation has commonly been used for cognitive 
data on the assumption that it will bring out any common factor (Harman, 
1960, p. 301). It was, therefore, used in the present instance in an attempt to 
obtain a general factor accompanied by other factors with psychological 
meaning. In Table 4 are listed the principal Quartimax factors for both 
groups of subjects. No general factor was obtained. Comparison with the 
Varimax factors shows a remarkable similarity. For both men and women, 
the sets of Varimax and Quartimax factors are almost precisely the same, 
differing by little more than the limits of precision written into the computer 


programmes. 
TABLE 4 


QUARTIMAX FACTORS FOR SIXTEEN TEST ITEMS. 





_ 
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4 
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8 
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VI.—CONCLUSION. 
As is usual with factorial analyses, there is more than one set of factors 
which have psychological meaning. Accepting the principal components as 
they stand, there is a general factor of reasoning ability accounting for approxi- 
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mately one-fifth of all the variance. Rotating to either the Varimax o: Quarti- 
max criterion, there are three factors which are related to features of the test 
items and to the reasoning abilities which they apparently demand. These 
factors, and certain differences between scores for men and for women, suggest 
that it would be valuable to have a detailed examination, possibly along the 
lines used by Duncker (1945) and Peel (1962), of the thinking processes invived 
in answering the items of this test. 
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THE DEVELOPMENT OF CHILDREN’S IDEAS ABOUT 
COUNTRY AND NATIONALITY 


Part I: THE CONCEPTUAL FRAMEWORK. 


By GUSTAV JAHODA 
(University of Glasgow) 


Summary. A random sample of children aged 6-11 ın four Glasgow schools 
was interviewed and tested, two situated in middle-class and two in working- 
class districts. The formation of concepts relating the series of units Glasgow— 
Scotland-Britain 1s traced in terms of both verbal formulation and spatial 
representation. The difficulty for younger children of such concepts as ‘ town’ 
and ‘country’ is demonstrated. The emergence of national identification is 
described, indicating the major sources of initial confusion. 


L—INTRODUCTION. 


'Turs study was originally inspired by Piaget (1928, 1951), who attempted to 
establish developmental stages for children's concepts of country and nationality. 
Unlike most other replications concerned with more clear-cut spheres of thought, 
such as the work of Lovell (1961) on mathematical and scientific concepts, the 
findings were in several fundamental aspects at variance with those of Piaget. 
The theoretical problems raised thereby will be dealt with separately. Here the 
aim is a two-fold one: to analyse the gradual emergence of concepts and 
relationships, and to offer tentative developmental norms for children of 
contrasting socio-economic background. 


IJ.—METHODS. 


(1) The Sample. The first stage of sampling was a ‘ purposive’ one: 
two primary schools were selected from well-to-do middle class areas, where 
arental occupations are mainly clerical, professional and administrative ; 
in contrast, two other schools were located in a high-density and now much 
dilapidated working class district, where semi- and unskilled occupations 
predominate. 

Within the schools, a sampling frame was constructed from the registers, 
pupils being divided according to age and sex irrespective of their form. The 
minimum age was fixed at six, because it had been obvious from preliminary 
trials that it would have been difficult to administer the somewhat lengthy 
procedure to the youngest children; and it was felt that uniformity was 
essential if valid comparisons were to be made. After excluding any children 
not of Scottish parentage, selection was done at each age-level by means of a 
table of random numbers. three boys and three girls always forming the unit. 
This meant that the sample was widely scattered, only a small number being 
taken from any one form. The object of this was to minimize the possibility of 
communication among the participants, and there is every indication that this 
was satisfactorily achieved. The general plan is, perhaps, easier to follow when 
set out schematically : 


Middle Class District (M) Working Class District (W) 
School A School B School C School D 
Boys Girls Boys Girls Boys Girls Boys Girls 


(Ages: 6- 7-8-9 10- 11-) 
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There were thus : 
2 (social classes) x 2 (schools) x 2 (sexes) x 6 (age levels) x3 (individuals) —144. 

Unless otherwise specified, all subsequent tables will be based on this total 
of 144 children. Although sex had to be one of the controlled variables, 
significant sex differences failed to emerge. Hence results will not be shown 
separately for boys and girls. This does not necessarily mean that there are no 
contrasts between the sexes in this field ; in fact, some of the differences were at 
least suggestive. However, in view of the relatively small numbers, dictated by 
the time-consuming nature of the interviews, it was felt that one would not be 
justified in putting much stresss on this aspect. 


(2) Interviewing and Testing. The children were seen individually. Each 
session lasted between 35 and 70 minutes, the mean being about 45 minutes. 
In order to arouse and maintain the interest even of the younger children, the 
procedure was devised so as to present considerable variety. 

In an investigation of this kind, the handling of the situation is of para- 
mount importance. The procedure was not started until the children had been 
put at ease by some preliminary chatting, and, in fact, an atmosphere of 
informality was aimed at throughout. The schedule itself was indended as a 
guide, notastraight-jacket. The questions in the schedule were certainly always 
used in their exact wording ; but the possibility of purely verbal misunderstand- 
ings was constantly kept in mind. Hence, particular questions were often 
re-phrased and put again, taking care at the same time not to introduce any 
suggestions. The purpose was always that of exploring the degree of clarity 
and the range of various concepts for the child. 


Naturally, this is a difficult task, and some influence of the observer upon 
what is being observed is unavoidable. As has been rightly said,* the very 
attempt to ascertain the child's capacity for comprehension is itself a learning 
situation. When a child is faced with a pointed question concerning some 
relationship for the first time, this may be the very stimulus which suddenly 
enables them to perceive how the elements fit together, so that a correct answer 
is produced after reflection. This did occasionally happen, but any error arising 
thereby will, from the present stand-point, be in the conservative direction. 
In other words, it will lead to an overestimate of the capacity to grasp a particular 
concept at any given age level. ` 


IIL—DEVELOPMENTAL FRAMEWORK. 
(i) Geographical Stages. 

There is general agreement that a child’s intellectual grasp of his environ- 
ment begins in his immediate vicinity and only gradually extends outwards. 
One can think of this as a series of concentric circles, with the child at the 
centre. Such a model, admittedly far too simple and neat to be a realistic 
portrayal of the child’s perspective, may, nevertheless, serve to define the first 
task : it is that of determining the extent of each child’s effective range. The 
word ‘ effective’ means in this context the range within which the child is 
capable of correctly apprehending relationships. 


One of the chief measures to be introduced here is what with some diffidence 
may be called the child's geographical stage, or ' G-stage ' for short. This is 
derived from the verbally formulated relationships, in response to questioning. 


* LOVELL and Ocirvriz (1960). 
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The measure was arrived at after attempting, without successs, to apply 
Piaget’s scheme. The one adopted here involves the use of what might be 
called ‘ natural ’ units for the purpose of categorization. Whilst not eliminating 
the element of arbitrariness entailed in the slicing up and pigeon-holing of what 
is essentially a continuum, it does at least tend to reduce it. There were, of 
course, a number of marginal cases, but the large majority could be easily and 
unambiguously classified. The categories, or G-stages, were as follows : 


G.I.: No conception of Glasgow as a unitary whole. 

G.II: Conception of Glasgow as unitary whole, but no conception of it as 
part of Scotland. 

G.I]. Conception of Glasgow as part of Scotland, but no conception of 
Scotland as part of Britain. 

G.IV: Glasgow-Scotland-Britain relationship correctly expressed. 


The ideas of children at these four stages will now be illustrated with 
examples. The questions by the interviewer are shown in brackets; the 
symbols after each quotation stand for type of school and age. 


G.I. , 

It was characteristic of children at that stage that they seemed to think of 
Glasgow as some kind of vague entity near-by, though not usually including their 
own immediate geographical location. 


(Where s Glasgow ?) 

Beside the playground, in X street. (W6) 

Its up by the park there—you go round the corner. (W6) 
Downin Y avenue. (MT) 

I don't know the way to Glasgow. (W7) 


Although nearly all were familiar with the word ' Scotland,' their notions 
of this concept were apt to be nebulous in the extreme. Several main tendencies 
are distinguishable. (8) the fusion of Scotland with Glasgow; (b) the belief 
that Scotland is somewhere outside Glasgow ; (c) random associations called up 
by the word ; and also combinations of these. 


(What 1s Scotland? Where is Scotland ?) 

I get muddled up—is that not Glasgow? Yes, it’s just Glasgow. (W6) 

My granny stays in Scotland. (Do you go and see her?) Yes, I go up at 
3 o'clock every day. (W7) 

I was born in it—it’s the name of the town. 
It's just outside Glasgow. (W8) 

Scotland is the capital of Edinburgh. It'sin Glasgow. (M7) 

Scotland is a street. (Ye, Scotland Street, but have you heard of a place 
called Scotland?) Yes, Scotland the Brave, it's up in the Highlands. 
(W : 

The Mid ee the telly—I saw it last night. (Where is Scotland?) 
Away far. (W6) 


The majority could say nothing about Britain at all, beyond perhaps the 
fact that they thought they had heard of it. If it was conceived as a place at 
' all, it was clearly a very remote one. 

(What ts Britain? Have you heard of Britain ?) 

I’ve heard the word, but I don’t know what it’s like. (W6) 

Yes. (What do you think 1: ıs?) Our next-door neighbour went to it and 
brought some dolls home. (MT) 
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It was there that the men fought the war. (W8) 


In general, it may be said that at this first stage the organization of 
geographical concepts is minimal, responses being mainly in the nature of 
diffuse and undirected associations. 


GII. 


The chief difference. between this stage and the preceding one is that the 
children’s answers showed an awareness of being within the unit called Glasgow, 
which was no longer thought to be located somewhere else. Responses to the 
question “ Where is Glasgow? " were fairly stereotyped, usually taking the 
form “ It's here, where we are." There were, moreover, a few replies which may 
be regarded as transitional to the subsequent stage, though they undoubtedly 
belong to G.II because of their inherent contradictions. 


Glasgow is in Scotland. (Where is Scotland ?) 

Scotland is a country outside Glasgow. (W7) 

Glasgow is in the middle of Scotland. (Whaá ts Scotland?) It’s a place 
where the people play the bagpipes. (Where?) Away in the High- 
lands. (M7) 

The remainders’ ideas about Scotland were largely similar to those of G.I, 
including the notions that Scotland is ‘in England’ or ‘ part of America’ 
(both M6). A higher proportion ventured some statement about Britain, but 
the level of accuracy was usually not much higher. 


(What is Britain?) 

It’s a city in England. (WT) 

It's a big town: mountains are there, and they climb on it. (WT) 

I don't think I’ve been there. (M9) 

It's a city in Scotland. (M9) 

The major advance, therefore, is confined to the consolidation of the con 
cept of Glasgow as an entity in which one lives and which constitutes one's 
surroundings. 


G.II. 


Here the stock answer became '' Glasgow is in Scotland." or words to that 
effect. Scrutiny of the total response patterns suggests, however, that one 
must be careful not to read too much into this. It is true that replies did not 
go astray until after the question “ Where is Scotland? ", but as will shortly 
become apparent the foundation probably tends to be somewhat shaky. 


(Whet ts Scotland? Where is Scotland? Whai is Britain ?) 

Scotland is a town made up of countries and towns. It’s up north. Britain 
is a place where you live in, a big, big, piece of land surrounded by 
water. (MT) 

Scotland is a country. It’s in the world. Britain is really the whole 
continent. (M8) 

Scotland is a country. It's beside Wales and Ireland. Britain is mostly 
called Great Britain; it's soit of Scotland—another name for it. 

M8 

Scotland s a country. Britain is the United State. (What do you mean? 
Long pause.) Of America. (W9) 

^ — Scotland is a country. (Where?) Can't say. Britain is a lot of different 
countries. (What countries?) Glasgow, London, France. (W11) 
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When one considers these and similar answers the phrases “ Glasgow is in 
Scotland " and “ Scotland is a country " begin to have a suspicious air of an 
empty formula. Additional evidence shows that this is not always. justified, yet 
in the main one seems to be faced with the parrot recital of terms learnt without 
adequate understanding. In other words, grasp of relationships has failed to 
keep pace with the acquisition of a new vocabulary. It is thus, perhaps, 
doubtful whether G.III should really always be accepted as a genuine stage 
regarding concept formation; it might be safer to look on it as a temporary 
transition between G.II and G.IV. 


G.IV. 

This can be dealt with very briefly. In order to qualify, a child had to 
convey some understanding of Britain as well-defined unit. The first of the 
quotations below represents roughly the acceptable minimum. 


Glasgow is in Scotland. Scotland is a country in Great Britain. Britain 
is some countries joined together. (M6) 

Glasgow is in the Central Lowlands of Scotland. Scotland is a country. 
It's just north of England. Britain is Scotland, England, Wales and 
Northern Ireland. (W11) 

With G.IV children have achieved the normal adult standard in Britain. 


The levels reached by various sub-groups are summarized in Table 1. In 
this and most subsequent tables results are presented separately for three age- 
ranges, comprising twenty-four children each; within age-ranges there is a 
further sub-division into M and W schools. This produces a general picture of 
developmental trends as influenced by age and socio-economic background. 


TABLE 1 


GEOGRAPHICAL STAGES REACHED BY TYPE OF Seroòr AND AGE. 








There is a highly significant difference (P less than -001) between the 
distribution of stages in M and W schools. It may also be noted that G.III 
contains a smaller total than the stages preceding and following it, thus affording 
some support to the view that it might well be a relatively brief transition stage. 


(ii) Spatsal Representation of Geographical Relationships. 

The procedure used in determining G-stages is open to the objection that it is 
conducted entirely at the verbal level, and in fact, some doubts have been 
expressed on this score as regards G.III. A special test dealing with the appro- 
priate spatial concepts was, therefore, devised. It was administered in two parts 
the first being a preliminary training session designed to familiarize the children 
in a simple manner with the symbolic relations involved. 
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FIG. 1 
MATERIALS USED IN THE SPACE TEST. 





BLACK Gchool/Britain) 


WHITE (Not used/Scotland) 
WHITE (Room/England) 


H 


BRASS Gubject/Glasgow) 
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The material employed, seen in Fig. 1, consisted of one black and two white 
plastic boards, and a brass disc. For the try-out the smaller of the white pieces 
was left out, and the child asked to reproduce the immediate situation, i.e., 
self-in-croom-in-School; this, is of course, analogous to Glasgow-in-Scotland-in- 
Britain. Any child failing in this preliminary task was given a detailed explana- 
tion and demonstration with the aid of the materials. This was necessary with 
12-5 per cent. of all M children and 37-5 per cent. of W children (P less than 


001). 


The decisive test came later, immediately following a musical test, so as to 
minimize carry-over from the questioning. This time the child was given all 
four pieces, with the information that the largest one was supposed to be 
Britain. The task was (a) to decide which of the others represented Glasgow, 
Scotland, and England, respectively, and (5) to assemble them correctly. The 
criterion for failure under (a) was simply not choosing the small brass disc for 
Glasgow ; only a child making this mistake, whatever the subsequent perform- 
ance, was classed as S.I. 


In order to qualify for success in positioning, three conditions had to be 
satisfied : brass disc on top of one of the white pieces; white pieces side by 
side, not on top of one another; white pieces within the area of the black one. 
Anything not conforming to this was again classed as failure, S.I. The most 
common failures are illustrated in Fig. 2. Among the youngest children, side- 
by-side placement of all the pieces often happened (S.I.—A) ; overlapping 
sometimes occurred (S.I.—B), or simple building up (S.I—C), which was an 
attempt to reproduce the earlier practice trial whilst coping with the additional 
piece. The most frequent failure of all was something like S.I.—D, namely, the 
brass disc flanked by the two white pieces; from the conceptual point of view, 
this suggests that such children thought of Glasgow and Scotland as two separate 
units, without as yet any notion of spatial inclusion. 


All children fulfilling the above conditions were scored as successes (S.II), 
ignoring both identification of white pieces and compass orientation. However, 
a further minor distinction was made between children who had at least one of 
these wrong (S.II,), and those who were right in both (S.II,). In the example of a 
successful solution shown in Fig. 2, the location of the Glasgow disc within 
Scotland was obviously carefully chosen; but the exact positioning was not 
considered in the scoring, as it would have been difficuli to know where to draw 
the line. 


Results are set out in Table 2, and it appears that the contrast between M 
and W is as sharp as it was in the case of G-stages, the overall difference being 
highly significant (P less than -001). Closer scrutiny reveals a distinct age 
pattern: at 6-7 there is little to choose between M and W; but whilst M 
children progressed steadily from then onwards, W ones exhibited no appreci- 
able e until after the age of 9, and then they were still far from having 
caught up. 


Another observation may be made concerning the relative frequencies of 
S.II, and S.I], : it looks as though the former were less stable. In other words, 
once the basic nature of the Glasgow-Scotland relationship is grasped, compass 
orientation and size soon come to be sorted out as well. 
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FIG. 2 
TYPES OF FAILURE AND SUCCESS IN THE SPACE TEST 
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TABLE 2 


SPATIAL STAGE REACHED BY TYPE OF SCHOOL AND AGE. 


W Schools 





(iii) Comparison of the Two Approaches. 


This is provided in Table 3, which shows a regular and orderly progression. 
The relationship is most clearly apparent from the bottom row, which gives 
the percentages of complete failure on the space test for each G.-stage. The 
first sizeable reduction in the failure rate occurs at G.III, and there is a 
spectacular drop between G.III and G.IV. 


TABLE 3 


RELATIONSHIP BETWEEN GEOGRAPHICAI AND SPATIAL STAGES, 











Per cent. SJ. for each 


96% 95% 72% 





The comparison offers a satisfactory confirmation of the interpretation 
already put forward. Stage G.III is clearly one of transition, in the course of 
which the verbal statement “ Glasgow is in Scotland ” remains largely an empty 
formula, devoid of an aquae conceptual appreciation of the part-whole 
relationship involved. On the other hand, a large majority of those able to 
given expression to the broader Glasgow-Scotland-Britain pattern have, in fact, 
achieved such a grasp. 

Before leaving this important section, attention should also be drawn to the 
wide variations in ages at which a particular stage is attained, which emerges 
from Table 1 and 2. It is noteworthy, moreover, that only half the W children 
of 10 and 11 have reached an adequate concept of spatial relations, a fact which 
tends to be concealed by the effects of rote learning. 


(iv) The Concept of ' Large Town.’ 
It may have been observed in connexion with some of the quotations that 
children were apt to describe countries such as Scotland and England as 
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‘towns.’ The converse error was also made occasionally, though there were 
fewer opportunities for this; a confusion of the kind is certainly implicit in 
such statements as “ Scotland is the capital of Edinburgh." 

Immediately following the topic of Glaszow, children were asked to name 
some other ‘ really big towns.’. This was an attempt to discover the range of 
notions conveyed by the phrase at different ages. In evaluating the findings, 
summarized in Table 4, one limitation of this approach must be kept in mind: 
the total number of responses given by each child was not controlled, and this is 
a possible source of variation. However, as the general pattern seems to make 
good sense, it cannot have been of great importance. 


Among the youngest children, nearly one-third were unable to givea any 
answer, or mentioned street names; the remainder of this group produced a 
large and ill-assorted variety of responses, only two of them entirely correct. 
In the intermediate age-group, most of the crudest errors had dropped out, 
leaving mainly a confusion between small towns, large towns and countries. 
Finally, at the ages 10-11, the chief remaining mix-up was between small and 
large towns. As before, M children were generally greatly i in advance over W 
ones (P less than :001). 


TABLE 1 


Types or Responses FoLLtowmc Request TO Name 'REALLY Bic Towns.’ 


10-11 | 6-7 


Large towns only* 
Large and small towns 


Countnes, large towns, and |- 
small towns 

Countries, small towns- and |. 
streets 

Streets only 

Unable to name anything .... 








On the basis of these findings, one may hazard some inferences about elem- 
ents of the thought processes involved. With the least mature group, the 
question evokes just any kind of geographical names that happen to be 
familiar to the child ; in other words, tbe concept of large towns is so diffuse 
that it is taken to cover a variety of units, from streets to countries.’ If these 
concepts have not yet become. differentiated, one can hardly expect the child to 
establish any kind of relationship between -hem, as was the task with regard 
to Glasgow, Scotland and Britain. 


Gradually some boundaries begin to take shape, and the child moves into 
a zone of transition. Whilst the difference between towns and countries is not 
fully understood, some children are able to produce at least an appropriate 
verbal formula to designate the relationship (e.g., " Glasgow is in Scotland "') 
othien, though fewer, are beginning | to eee tlie spatial relations. ` 


* A ‘large’ town was defined as having a minimum population of 150,000. 
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The final stage, in terms of the classification adopted here, is the achieve- 
ment of a sharp discrimination of units. This permits both an insightful verbal 
formulation, as opposed to parrot recital, and correct abstract 1epresentation of 
spatial arrangements. 

If such an interpretation is valid, it follows that the notion of what con- 
stitutes a ‘really big town’ ought to be closely associated with both G and S 
stages. Table 5 indicates.that this is, in fact, the case (P less than -001). 


TABLE 5 


TYPES or ‘ REALLY Bic Towns’ RESPONSES IN RELATION TO G AND S STAGES. 


Response Categories 


and/or large Al other 
towns and responses 
countries 








Numbers in each category 
Percent. in G III and G IV 
Per cent. in S II 





IV.—COUNTRY AND NATIONALITY. 
(i) Spontaneous National Idenisfication. 


The first question asked when the interview proper started was “ What 
are you? " The object was to elicit the dominant manner in which children 
tended to view themselves, before anything else could introduce a bias; in 
particular to find out to what extent, if any, nationality entered into what are 
commonly called children's ' self-images.’ 


Results can be put very briefly : some nine-tenths confined th emselves to 
their sex-role, i.e., they said “ I am a boy/girl.” Further probing brought out 
mainly qualifications of the statement already made (e.g., ‘‘ I am a schoolboy,” 
“JT am a good girl"), or with older and/or brighter children, some super- 
ordinate categorization like ‘ person ' or ‘human being.’ Out of the total sample 
only just over 3 per cent. described themselves as ‘ Scottish,’ all being from M 
schools. It may also be mentioned that the same proportion referred to their 
religion. 

This may be compared with the findings of a study by Lambert and Kline- 
berg (1959) carried out about the same time, and. using the identical question. 
Three sets of English children (among other nationalities) aged 6, 10 and 14, 
respectively, were included in the survey. Omitting the oldest, the combined 
proportion of the others who called themselves English came to about 3 per 
cent., almost exactly the same. This is, perhaps, somewhat surprising, as one 
might have anticipated that Scottish children would show a relatively greater 
awareness of their nationality. Itis possible that such a difference would appear 
later, as the 14-year-old English children showed a great upsurge in the Esau 
of giving their nationality. 
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(ii) General Inquiry about Country and Nationality. 


For most adults the phrase ' my country ’ has powerful emotional connota- 
tions. The problem of the growth of such sentiments is merely touched upon 
1n this investigation, which focusses primarily on cognitive development. Yt was 
concerned, however, with the emergence of understanding of the object to which 
such sentiments refer. 

Before Scotland or Britain had been mentioned by the investigator, 
children were asked “ What is your country? " It is appareni from Table 6 
that the youngest children often had no idea, some naming either their street 


TABLE 6 


DISTRIBUTION OF ANSWERS TO THE QUESTION ‘WHAT IS YOUR COUNTRY ?' BY TYPE OF 
SCHOOL AND AGE, 








or local district. A frequent answer was ‘ Glasgow,’ and this may have been 
influenced by the preceding talk about the home town. With rising age, 
‘Scotland’ gained increasing prominence, whilst the small proportion who 
named Britain remained fairly constant. Differences between M and W schools 
were significant (P less than -01). The general pattern fits in well with the view 
that children begin with an awareness of their immediate locality, which 
becomes progressively extended outwards, fart passw with a sharperned 
discrimination concerning such geographical terms as ‘ town’ and ‘ country’ 

The question “ What is your nationality ? ’’ held few surprises. Among M 
children only one in the 6-7 group understood it, as compared with one-third of 
those aged 8-9 and three-quarters of the oldest ones ; corresponding figures for 
W children were, respectively, nil, a single one and one-third. Most of those 
who understood the question gave a correct reply, i.e., ‘Scottish’ or ‘ British,’ 
but a few said ' England ' or ‘ English.'* The probable reason for such answers 
will be encountered shortly. 


(ili) Understanding of Relationships. 


After reference had been made to Scotland and Britain, children were 
asked specifically whether they were Scottish, and whether they were British. 
As is evident from Table 7, a majority of all ages gave an affirmative reply to 
at least one of these questions. Moreover, there is no significant difference 
between M and W, indicating that a verbal knowledge of nationality is acquired 
at roughly the same rate in both kinds of environment. Most of those aged 10-11 
acknowledged that they were both Scottish and British ; the problem is, how- 
ever, to what extent this corresponded to a grasp of the implicit part-whole 
relationship. 


* [t will be remembered that children not of Scottish parentage had been excluded 
from the sample. 
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TABLE 7 ^ 


DISTRIBUTION OF COMBINED ANSWERS TO THE QUESTIONS ‘ARE You SCOTTISH? ARE 
You BRITISH ? ' By TYPE or SCHOOL AND AGE. 


‘No’ or‘ Don't know ’ to both 
Scottish ' Yes' and British ' No' or 
‘Don’t know’ 


British ‘Yes’ and Scottish ‘No’ or 
' Don't know ' 





In order to test this, a follow-up was used with children who had said they 
were both, and with those who were positive about one but uncertain about the 
other. All these were asked about the possibility of being Scottish and British 
at the same time, and invited to explain their reasons. 


The outcome, summarized in Table 8, is instructive. The most frequent 
reason offered for rejecting the possibility was mutual exclusiveness—'' you 
can't be two different things at once." The other prominent feature, used as a 
justification on either side, was language. In fact, language and nationality 
were commonly mixed up, which probably led some of the children to say 
‘English’ when asked about their nationality, as mentioned earlier. A few 
quotations will illustrate the prevalent state of confusion, the first pair below 
having answered ‘ Yes ' and the second ‘ No’: 


British is your language and Scottish is your accent. (M8) 


Down in England you speak British and here in Scotland you still talk 
British. (W10 


I speak English—I’m really Scottish. (M6) 
You can't talk Scottish and British at the same time. (W9) 


The first two happen to be consistent, though obviously without any 
glimmer of understanding ; the last two, having declared previously that they 
were Scottish and British, had changed their position, yet there was no indica- 
tion that they were aware of it. On the other hand, a small number of children 
did show such an awareness. For instance, one decided * You can't talk 
different ways," and reverted to Scottish only ; another, possibly with a notion 
of Gaelic at the back of his mind, came to the conclusion that he was British 
but not Scottish because '' All the Scottish speak a different language to us ” ; 
one altered his answer from ' No’ to ' Yes,’ explaining “ Well—you can speak 
all the different languages." Lastly, an 11-year-old, after an initial emphatic 
' No,' hesitated and finally reached the correct answer. This is an example of 
the interview as a learning situation, in the course of which new insights may be 
gained. 
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^ TABLE 8 


IDEAS ABOUT THE RELATIONSHIP BETWEEN 'SCOTTISH ' AND ' Britisu.’ 


Can one be both ? Explanation given 





' Can't be two things at same time’ 
Different language 





Same language 
Other 





The miscellaneous answers contained some curious specimens. Some 
thought that Scottish and British were just different names for the same 
country ; one explained that his father was Scottish and his mother British, 
and a 6-year-old hit upon an ingenious solution : ' One week I'm Scottish and 
the next week I'm British." 


The lesson of this follow-up is that replies to the nationality questions 
cannot be taken at their face value. When challenged, about a third of the 
children said that one could not be British and Scottish at the same time, 
thereby mostly contradicting their earlier responses and showing that these had 
not been rooted in a genuine understanding ; others continued to affirm the 
possibility of being both, yet without being able to provide an adequate 
justification ; and less than half were able to demonstrate their grasp of the 
underlying relationship. 
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PSYCHO-LOGIC: . A POSSIBLE ALTERNATIVE TO 
PIAGET'S FORMULATION 


By G. HARRY McLAUGHLIN 
(Northampton College of Advanced Technology, London) 


Summary. The limitations of children's reasoning observed by Piaget can be 
described by a restricted logical system, called a psycho-logic. It ıs shown that 
psycho-logic can be simply represented by a small set of finite Boolean algebras. 
'These are interpreted to suggest that limitations of reasoning correspond to 
limitations of immediate memory or attention. 


PIAGET (1947) is particularly noted for the discovery that children behave as 
if their thinking conformed to the rules of a psycho-logic, meaning a logical 
system too restricted to permit certain valid deductions to be made. He also 
observed that a child's ability to reason progresses through an invariable 
sequence of four main stages, outlined below. He accounts for this sequence 
by suggesting that the development of higher stages of reasoning depends upon 
the child achieving concepts which are more and more abstract. 

Nearly all the generalisations on the development of reasoning given b 
Piaget (1937), Piaget and Inhelder (1941, 1959) and Inhelder and Piaget (1955), 
have been supported by numerous careful investigations, such as those fre- 
quently reported in this Journal by Peel and his co-workers. 

To describe the four stages of intellectual development, Piaget formulates 
four levels of psycho-logic. Each level is a logic so restricted that it yields only 
those inferences which can be made by a subject at the corresponding stage. 
Furthermore, Piaget advances the hypothesis that if, at any level, a formula 
can be derived from another formula in psycho-logic, a subject at the end of the 
appropriate stage can carry out cognitive processes corresponding to both 
formulas. 

Psycho-logic has been set out in detail by Piaget (1949); particular 
aspects of it have been exhaustively treated by Piaget (1942, 1952) ; and it is 
summarised by Piaget (1947, 1953). The last-mentioned volume has an 
introduction by Mays explaining with admirable clarity the logical terms used 
in psycho-logic. Unfortunately, as Beth (1950) noted, Piaget’s own expositions 
of his system contain serious confusions and are consequently hard to follow. 
Grize (1960) has expressed the system in a rigorous form, but this is even more 
formidable to the non-logician, and few of its terms appear to have psycho- 
logical referents. 

The difficulty of Piaget’s system seems to stem from his attempt to translate 
directly into logical terms the psychological notion of increasing conceptual 
abstractness. This gives the system a structure different from that of con- 
ventional logics. 

However, it will be shown that a psycho-logic can be adequately formu- 
lated in terms of the algebra of classes, which is explained in most introductions 
to modern logic (e.g., Langer 1953). It will also be shown that each level can 
be simply defined by the number of different classes that are distinguished 
simultaneously. Furthermore, this number can be directly identified with a 
clearly measurable psychological characteristic—memory span. 
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62 Alternative to Piaget’s Psycho-logic 


Briefly, the argument is that a subject solving an intellectual problem must 
retain a number of items at the same time, the number depending on the 
logical complexity of the problem ; and that the number of items a child can 
^ retain simultaneously increases as he grows older ; therefore, as the child matures, 
he can solve more complex problems. In other words, each of the four stages 
in the development of thought is characterised in part by the number of items 
that a child can retain at the same time. It should be noted that increases in 
retentivity are claimed to lead to changes only in the logical aspect of thinking : 
no doubt changes in the quality of thinking in its other aspects involve further , 
factors. Furthermore, an ability to retain a certain number of items is claimed 
only to be a necessary condition for making certain kinds of inference : it is not 
proposed as a sufficient condition. 

There are at least two ways in which the number of items retained simul- 
taneously could increase with age. The older child might either be able to 
remember more items after a single presentation, or be able to attend to more 
items while they are being presented. Even more probably, memory capacity 
and attention capacity increase together, perhaps because they are two aspects 
of the same process. For convenience of exposition, a child's growing ability 
to consider a certain number of items simultaneously will usually be treated 
simply as a function of enlarging immediate memory capacity, but it must be 
emphasised that this notion is not fundamental to psycho-logic. 

The size of memory span is greatly affected by redundancy of the stimulus 
material. That is to say, if an item in a list to be recalled is deducible from other 
items, a subject's memory span will be increased by that redundant item. 
However, we will be concerned with items that are by definition non-redundant : 
these G. A. Miller (1956) has named chunks. Each digit in a digit span test 
forms a chunk, unless the subject is capable of mentally combining the digits 
into groups. Thus, from the norms of digit span tests (e.g., Wechsler 1949, 1955) 
we can say that the immediate memory can simultaneously retain two chunks 
at about 2} years of age, four by 44, and seven by 134, falling to six at about 
55 years of age. Thus, immediate memory capacity increases and declines in 
the same way as intelligence, correlating between -42 and -62 with full Wechsler 
Intelligence Scale scores. The reason why the correlation is no higher may be 
that, although a given stage cannot arise until the capacity to retain enough 
chunks has developed, it is not consistently manifested until much later, when 
the child has learnt to combine and separate the chunks mentally. 

Let us call that part of his environment which a subject is considering at any 
moment his universe. This universe can be characterised by the algebra of 
classes. In this logical system, any collection of entities, physical or mental, is 
called a class. A class can consist of a single entity. A class with no members 
(e.g., the class of unicorns) is said to be empty. There can also be classes formed 
of classes. If A and B are classes, their logical product is the class of entities 
in both A and B ; and their logical sum is the class of entities in A or B or both. 
Classes which are parts of a larger class are called its subclasses. The universe 
class comprises all the classes relevant to a given proposition. The complement 
A’ of a class A consists of all entities in the universe class that are not in A. 

We may now define a subject's concept of a class as the specification by 
which he decides whether or not an entity is in that class.* To put it another 


* Again for simplicity of exposition, 1t 1s assumed that a subject categorises a given 
entity with certainty as either being or not being in a given class. If, however, he categorises 
an entity as possibly being in a given class, the argument of this paper still stands, but the 
logical apparatus required is the modal calculus noted by Lewis and Langford (1932, p 501) 
a8 being equivalent to that discussed here. 
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way, a subject’s concept of a class consists in his knowledge of its distinguishing 
attributes. Gestalt psychology shows that we can form a single unitary con- 
cept of an entity or class of entities with any number of attributes. However, 
the subject, by selectively attending to any combination of attributes, forms as 
many concepts as there are combinations to consider. For example, if he sees a 
big white swan, he may apprehend the object either under the concepts ‘ swan,’ 
‘ big (thing),’ ‘ white (thing),’ ' big white (thing),' ‘ big swan,’ ‘ white swan ’ or 
‘big white swan.’ Furthermore, the subject might restructure the concepts 
‘swan’ and ‘black’ to form a concept of their product, which would be 
manifested by some such remark as: “ Are there black swans? " He might also 
restructure the concepts ‘swan’ and ‘raven’ to form a concept of their sum, 
for which he would probably know the name ' bird.’ By selective attention to 
aspects of the concept of a sum or of a product, the subject could again form the 
component concepts ‘ swan,’ ‘ black,’ ‘ big,’ etc. 


Now, making a deduction reduces to solving a problem of the type: given 
that certain classes are or are not empty, what other classes are or are not 
empty ? At the worst, all the classes defined by the attributes of the given classes 
must be considered. The maximum number of classes that might need to be 
considered is easily calculated. Suppose the given classes are distinguished by 
a number, N, of different attributes (or sets of attributes that occur together), 
then each relevant, distinct class is defined by one of the 2N possible combina- 
tions of the presence or absence of those attributes. For example, the classes 
given by the premisses of a syllogism vary in three attributes, so finding a con- 
clusion might involve considering up to 2*=8 classes. In practice, many prob- 
lems are solvable without considering all the classes that might be relevant. 


As Bernstein (1935) has shown, the importance of N is that for each value 
it is given, it specifies a unique logic, called a finite Boolean algebra. We will, 
therefore, label each level of psycho-logic by the appropriate value of N—which 
must be a positive integer, unless it is 0, as will be explained below. 


Because the 2N classes defined by N attributes are non-redundant, the 
concept of each class corresponds to a chunk. Therefore, the level at which a 
subject could reason can be inferred from his digit span, always presuming that 
this is equivalent to his memory span for concepts: the subject will be at level 
I if the span is 2; at level 2 if it is 3 or 4; and at level 3 if it is from 5 to 8. 
As already mentioned, finite Boolean algebras would still be valuable as descrip- 
tions of the logical status of reasoning at the various psychological stages, even 
if stages and memory spans did not correspond. But if there were no corres- 
pondence, the hypothesis that a certain memory span is a pre-requisite for 
reasoning at a certain level would have to be discarded. However, a search 
of the literature, including most of that cited by Munn (1954, p. 412 ff.), has failed 
to reveal evidence of children making inferences while not having the memory 
spans required by the hypothesis. Indeed, remembering that memory capacity 
is only a necessary condition for reasoning at a certain level, and that success 
cannot be assured without the maximum capacity required at that level, it will 
be shown that there is a satisfactory agreement between the ages at which the 
various memory spans are attained and those at which the corresponding 


levels begin. 


The following sketch of levels and stages uses those of Piaget's hypothetical 
constructs that are clearly supported by empirical data ; terms for these con- 
structs are italicised. 


64 Alternative to Piaget's Psycho-logic 


LeveL 0. This demands the capacity to process 22—1 concept at a time. 
It corresponds to the stage of sensorimotor intelligence. This is defined by Piaget 
(1937) as the period up to two years of age, during which a baby learns to form 
object is, that is, to treat a given object as the same object however, and 
whenever, it is perceived. 


We may suppose that some conditioning process leads the baby to categor- 
ise identically percepts similar in important respects. It is not known exactly 
how object concepts are attained. Piaget (1957, pp. 86-9) has suggested that a 
baby twiddling some object around finds that there is an invariant in the group 
of spatial displacements he has performed upon the object, and thus learns that 
the object is permanent : yet it must be pointed out that any finite number of 
twiddles may not form a group, and how invariants are extracted from percep- 
tion remains unexplained. 


For present purposes, it suffices to remark that the baby can make no 
inferences. This is due to the fact that an inference, by definition, involves 
relating at least two concepts, and at level 0 only one concept can be differen- 
tiated at a time. It may now seem paradoxical to suggest that the baby's 
object concept has N=0 distinguishing attributes. The paradox vanishes, 
however, when it is noted that if he has only one concept in mind there is no 
other concept to distinguish it from, therefore, distinguishing attributes are 
meaningless in this context. 


LEVEL 1. This level demands the capacity to retain 21—2 concepts 
simultaneously. It corresponds to the stage of pre-operational intelligence. 
This lasts from two to seven years of age, and is defined by Piaget and Inhelder 
(1941) as the time during which the infant comes to form class concepts, that is, 
to classify objects. 


At first the infant may fail to note the distinguishing attributes specified in 
an adult’s concept of a given class, so he may make classifications which adults 
think mistaken. To overcome this he will, presumably, try to learn the dis- 
tinguishing attributes of all manner of classes ; this he seems to be doing when 
he asks a series of questions of the form: '' What's that? A cow? Why? 
What's it for? " 


A more serious source of confusion is the infant’s inability to distinguish 
between more than an object and its environment. He cannot at the same time 
note the relationship between the object and its environment, because that 
would necessitate the simultaneous retention of a third concept. But knowing 
the relative position of an object is essential to deciding whether percepts of 
similar objects encountered at different times should be classified as a number of 
objects of the same class, or as the same object. Piaget (1937, p. 374) has 
observed that infants are often in doubt on this score. 


On the other hand, the infant’s capacity to retain two concepts at once 
enables him to draw comparisons, which in turn permits him to make a rudi- 
mentary form of inference. If he is confronted with an object he can form and 
retain a concept specifying those of its attributes to which he has attended. 
Tf he now encounters some new object, he can compare it with the retained 
object. But as he can simultaneously retain two concepts at most, he must 
classify his concepts of the two objects as being either identically the same, or 
quite different : he cannot have a third concept of their similarities and differ- 
ences. Therefore, if the new object has at least one of the attributes of the 
retained concept, he may expect to find that it has all the other attributes as 
well. Stern (1924) calls this kind of reasoning from one concrete instance to 
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another ‘ transduction’: it is exemplified by his daughter who, at the age of 
3, knew a rope as something to support the family wash, so when she saw a 
rope strung along a balcony, she asked: “ What's that hanging up there? 
Something to pin clothes on ? "' 

Thinking at this stage is also characterised by the phenomenon of frre- 
versibility. It is exemplified by an infant who is shown some brown beads and 
some white beads. He will correctly tell a questioner that the brown beads are 
wooden and that the white beads are wooden. But, if he is shown a box con- 
taining a larger number of brown than white beads, and he is asked whether it 
holds more brown beads or more wooden beads, he will insist that there are more 
brown beads. This would be predicted of an infant who cannot think of three 
concepts simultaneously: for, if he has already classified the white wooden 
beads as ‘ wooden ,' and the brown wooden beads as ' brown,’ or even ' brown 
wooden,' he has not the capacity to restructure his concepts to form a third 
concept ' wooden of any colour.' 

LEVEL 2. This level demands the capacity to process up to 2?—4 concepts 
simultaneously. It corresponds to the stage of concrete operational intelligence. 
This is defined by Piaget and Inhelder (1959) as the time, between 7 and 11 
years of age, when the child learns how to use concrete operations, which are 
processes of restructuring concepts, that can be illustrated by grouping tangible 
objects. 

Suppose that a child has formed concepts of two classes, A and A’. If he 
considers the attributes of A and A’ simultaneously, he thereby forms the 
concept of the sum of A and A’. This new third concept may be designated 
B. Now the child's immediate memory can accommodate the concept of a 
further class, B', which can in turn be coupled with the concept of B. This 
process, called the concrete operation of addition, may be continued indefinitely. 
It is manifested when the child couples classes of objects to form a wider class. 
Addition is also manifested when the child puts objects into a series ordered 
by some specific relation. This is now possible because the child has the capacity, 
not only to retain concepts of each pair of objects involved,but also the further 
concept of the relations between them. 


The notion that the child forms wider class concepts by combining concepts 
of subclasses two at a time is similar to that advanced by Piaget (1942 ; 1947, 
p. 43; 1949), who states that a child forms the concept of a class (e.g., ánimals) 
by combining the concept of a subclass (e.g., insects) with the concept of the 
complement of that subclass (non-insects). But the complement seems to be 
defined with respect to the new class (thus making ' non-insects ' mean ' animals 
that are not insects ’). This would imply that the child must already have the 
concept of the new class in order to form that concept—a circularity which 
obviously cannot be intended. 


Next suppose that the child has in his immediate memory the concepts of 
two classes C and D. Then the child can classify further concepts using, at the 
same time, the two criteria, ' having the attributes of C’ and ' having the 
attributes of D.’ This is manifested in the concrete operation of multiplication, 
that is, in sorting objects into four product classes according to whether they 
have the attributes of C arid D; of C but not of D; of D but not of C ; or of 
neither C nor D. Multiplication i is also manifested when a child finds corres- 
pondences between two given independent series. 


The delay between the attainment of a four-chunk memory span, between 
the ages of 4 and 5, and the start of the concrete operational stage, at about 7 or 
8 years, is more apparent than real. In fact, Piaget and Inhelder (1959) give 
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statistical tables which may be summed up by saying that no 4-year-olds can 
perform concrete operations, but that some can be carried out systematically by 
an increasing number of subjects from the age of 5. Evidence of the early 
performance of concrete operations is also given by Welch (1940) who, incident- 
ally, shows the relation between the abstractness of concepts and their number. 
Welch distinguishes stages in the development of abstract thought by the 
hierarchic order of concepts a child can form. At 26 months, on average, his 
subjects attained first-order concepts: for example, potatoes and apples were 
both:classed as foods. The second stage was reached at 4j.years, when his 
subjects attained second-order concepts, potatoes being classed as diia: 
apples as fruit, and both.fruit and vegetables as foods. 


- Concrete operations are said to be reversible, because the child can behave 
as if he had mentally cancelled an operation, either by doing the opposite (a 
type of reverse operation called inversion), or by performing an operation that 
compensates for the first one (reciprocity). Thus, for the-first time he can argue 
that if lemonade is poured from a tall, slim glass into a short, fat one, there 
will still be the same amount to drink, because the lemonade can be poured back 
again to look as it did at first (inversion), and because decreasing the lemonade's 
depth has been compensated for by giving it more breadth (reciprocity). 


At this stage, however, the child cannot understand how inversion and 
reciprocity can be combined and so effect correlation. For example, the child 
may be faced with an apparatus consisting of two communicating vessels 
containing a liquid ; if a piston is pressing on the liquid ia orié vessel, the level 
of the liquid in the other vessel will vary according to the weight of the piston 
and the density of the liquid. A child, at this stage, cannot éduce the correlate 
needed to answer the problem : “The operation of increasing the weight of the 
piston is to decreasing the weight of the piston (inversion) as increasing the 
density of the liquid (reciprocity) is to . . . what ? ” (Inhelder and Piaget, 1955, 
p.148 ff). The hypothesis of an immediate memory limited to holding four 
cóncepts at a time accounts for the child's inability to educe such a correlate, 
fdr to understand the problem he would have to retain not only the concepts of 
an operation, its inversion and its reciprocity, but also COTE of the three 
relationships involved. ] 


E The hypothesis also explains why it is 'not until the next stage that a child 
can, remember to keep all factors constant except one during an investigation 
(Inhelder and Piaget, 1955, p. 46 ff.). 


LeveL 3. This level demands the ability to process up to 22=8 concepts 
simultaneously. This corresponds to the stage of formal operational intelligence, 
which is reached by 11 to 15 years, when the EN eel in Peat 
with a fully practical logic. 


- ^ If the adolescents can now retain up to eight heene, he can, in Tade 
_ make any inference involving classes ‘distinguished by up to three attributes 
(or sets of attributes), such as a syllogism, so:the Ionia. of the es level 
vanish for nearly all practical purposes. : ; 


Indeed, it will be an unusally complex problem that "E the retention 
of as many as eight concepts. The conclusion to the simplest form of syllogism 
(known as Barbara) can'be found even with level 2 logic by a process akin to 
logical addition : the first premiss says that all of class A has the attributes of 

lass B, so that the concept of their sum, ‘A and B, can be formed and used to 
substitute for A or for B; the second premiiss says that all of'class B has’ the 
attributes of class-C, so that class A has the attributes of class C; which is the 
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required conclusion, Howéver, if this kind of reasoning is applied to other 
premisses, if may, produce invalid conclusions. In fact, most fallacies in reason- 
ing can be accounted’ for on the hypothesis that the problem being considered 
calls for the construction of more concepts than can be kept erect in the 
immediate memory. 


A level 4 logic, which would enable subjects to selves problems Ewan 
concepts varying in four different ways, is, of course, possible. However, it isa 
rare person who is intelligent enough to solve such problems mentally, because 
they require the simultaneous retention of up to 2*— 16 concepts. 
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PREDICTING STUDENT PERFORMANCE IN A 
UNIVERSITY DEPARTMENT OF EDUCATION 


By F. W. WARBURTON, H. J. BUTCHER anp G. M. FORREST 
(Department of Education, University of Manchester) 


Summary. One hundred teachers in training in the Department of Education, 
University of Manchester, were tested with measures of abilities, personality, 
-interests, values and general culture, and provided a considerable range of 
biographical information. The predictive power of these various measures was 
examined in terms of three main criteria—final teaching mark, final theory 
mark, and award of the Graduate Certificate in Education Degree class was 
the best single predictor of theory mark and of final award. A personality 
questionnaire (Cattell’s 16 P.F.) was the best single predictor of teaching mark. 
The successful student tended, apart from academic qualifications, to have a 
high level of general culture, high conscientiousness (16 P.F , factor G}, to 
have tender-minded attitudes to education, and to participate fairly widely in 
social activities. 


I.—INTRODUCTION. 


Tue literature dealing with the prediction of success in teaching and success in 
teachers' training courses is very large and very inconclusive. Some idea of its 
extent may be obtained from the annotated biography of Domas and Tiedeman 
(1950) which contains over 1,000 references to papers on ' teacher competence ' 
up to 1950. Its inconclusiveness is also well known. As Evans (1959) says: 
“The most obvious deduction from the available evidence is that there is no 
one pattern of successful teacher. Teaching is a complex process calling for 
many different abilities, no one of which is by itself sufficient to ensure success. 
At the same time, each of these abilities makes a small but necessary contribu- 
tion to success " (p. 33). 


Among the variables which have been previously studied as possibly 
related to teaching success are: biographical data such as age, marital status 
(e.g., Ryans, 1951), intelligence and abilities (in a large number of studies), 
personality factors measured by questionnaire or by objective tests (Lamke, 
1951; Schwartz, 1950), interests (Strong, 1943), attitudes to teaching (Cook, 
Leeds and Callis, 1952) and many others. In addition to the study of theoretical 
„predictors of this type, much research has been devoted to the predictive power 
of interviews and kindred methods of selection (e.g., Burroughs, 1958). 


. The results have, in general, been rather disappointing. Several causes 
have clearly contributed to the prevailing uncertainty in this field. Firstly, as 
stated by Evans in the quotation above, the criterion is a complex one. Secondly, 
in some.studies the criterion has been teaching mark at student level, in others 
rated success as a teacher after varying periods of service. Little is known about 
the relation between these two criteria, as is pointed out by Collins (1959), 
since the few correlations available range from + -84 (Tudhope 1942) to 4-16 
(Martin, 1944). Thirdly, the numerous variables which have been tested as 
predictors have generally been tried out on quite different samples, varying in 
age, nationality, sex, degree of selection and other respects, with the result that 
itis hard to compare the results obtained and the relative values of the different 
types of predictor. 
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These last two factors are our justification for the present paper, for it 
describes the administration of tests and the collection of data of all the types 
described above with one sample of students in a typical university department 
of education. Although the present research uses the teaching practice marks 
of graduate students as its principal criterion, it is also planned to follow up the 
sample after five years’ teaching experience, and thus, firstly, to provide further 
evidence about the relation between practice marks and later success, but also, 
more importantly, perhaps, to determine which of the predictors of student 
teaching success hold up, and which deteriorate, when considered against the 
later and more fundamental criterion of success in teaching as a career. 


IL—DETAILS OF THE RESEARCH. 
(a) Objectives. 

The programme of research thus has three main aims: (i) to obtain as 
complete a picture as possible of a sample of graduates entering the teaching 
profession in terms of biographical and academic background, abilities, interests, 
values and personality factors; (ii) to determine the relative efficiency of these 
variables in predicting success in the course of professional training ; (aii) to 
follow up the subjects after five years' teaching experience and re-determine 
the relative predictive values in terms of success in teaching as a career. The 
present paper is mainly concerned with the second of these aims. 


(b) The subjects. 

Students taking the Graduate Certificate in Education course in the Depart- 
ment of Education, University of Manchester, in the session 1957-58, numbered 
133. Volunteers were called for to undergo an extensive programme of psycho- 
logical testing, and 118 attended the first period of testing, but inevitably by the 
time data on approximately 100 variables had been assembled, a number of 
gaps had appeared. The number of students on whom information on every 
variable was available had shrunk to rather less than 100. However, to facilitate 
analysis by the electronic computer, and to avoid the complication of slightly 
different numbers in the sample in different variables, the gaps were filled by 
assigning to a student who had missed a test, or about whom information was 
missing, the mean score of the whole group on that particular variable. This 
expedient was necessary in 2:3 per cent. of the entries in a 100 x 100 score matrix. 


(c) The variables. 

The variables included in the research comprise fifty-seven test scores, 
twenty-five biographical pa.ticulars and eighteen criteria of success on the 
training course, as follows : 


TABLE 1 
(1) TEST SCORES. 
1—Non-verbal intelligence Culture Free Scale 3A (Cattell). 
2—Verbal Intelligence. National Institute of Industrial Psychology. 
Group Test 33 
3—Spatial ability. National Institute of Industrial Psychology. 
Group Test 80A 
4—Memory. G Primary Mental Abilities Thurstone. 
5—Verbal Fluency. Primary Mental Abilities. Thurstone. 


6—Mechanical Comprehension. Mechanical Comprehension. Bennett. 
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i 7—NVocabulary. 


8—General Culture. 


9—Cyclothymia vs. Schizothymia. 
10—Intelligence. - 
11-—Ego Strength. 
12—Dominance. 
]3—Surgency. 
14—Conscientiousness, 
15—Extraversion. 
16—Sensitivity 

17—Paranoid Tendency 
18—Unconventionality. 
19—Sophistication. 

20—Guilt Proneness (Anxiety) 
21—Radicaliam 
22—Self-sufficiency. 
23—Self-control. 

24—Id Pressure (Tenseness) 
25—Onutdoor Interests. 


26—Mechanical Interests. 
27—Computational Interests. 
28—Scientific Interests. 
29—Managerial Interests. 
30—Artistic Interests. 
31—Literary Interests. 
82—Mnusical Interests. 

33— Social Interests 
34—Clerical Interests. 
35—Theoretical Values. 


36—Economic Values. 
37—Aesthetic Values. 

38—Social Values 

39—Political Values. 
40—Religious Values 

41-——Radical attitudes. 
42—Tenderminded attitudes 
43—Radical attitudes to education. 


'44—Tenderminded attitudes (o education. 
45—Naturalist attitudes to education. 
48—Accuracy in non-verbal intelligence 


test 


49—Accuracy in verbal intelligence test. 


50—Accuracy in spatial ability test. 


51— Accuracy in mechanical ability test. 


52—Accuracy in vocab test. 


53—Accuracy in general culture test. 


^ 54— Emphasis in personality test. 


55— Emphasis in values test. 


56—Emphaais in social attitudes teat. 


tion. 


Primary Mental Abilities, Thurstone (70 
items), together with 30 difficult words 
from various other sources. 

100 item test, constructed by H. J. Butcher 
and F. W. Warburton, Department of 
Education, University of Manchester. The 
subject is required to give the name of the 
, authors of a list of books, covering. the 
humanities, sciences and social Sciences and 

. works of art and music. 

16 P.F. Factor A (Cattell) 

» n» FactorB: ,, 

» » Factor C » 

» » Factor E 

» » Factor 

» » FactorG 

» » Factor H 

» » Factor I 

» » PactoL . 

» » Factor M 

» » Factor N 

» » Factor O 

» » Factor Q, 

» » FatorQ, 

» » Factor Qs 

» » Factor Qi 

Kuder Preference  Record-Vocational 
Form CH. 


Study "of Values - (Allport, "Vernon and 
Lindzey). . 4 


Inventory of Social Attitudes (Eysenck). 


Survey of Opinions about Education (Oliver), 
Department of Education, University of 
Manchester. 


Proportion answers correct in Test 1. 
Proportion answers correct in Test 2. 
Proportion answers correct in Test 3. 
Proportion answers correct in Test 6. 
Proportion answers correct in Test 7. 


' Proportion answerscorrectin Test 8. < 


No. ' extreme ' answers in 16 P. T". Test. 

No. extreme answers in study of Values test. 

No.extremeanswers in Inventory of Social 
Attitudes. 


57— Emphasis in radical attitudes to educa- No. extreme answers (radical) Survey of 


Opinions about Education. ` 
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-58—Emphasis in tender-minded attitudes No. of extreme answers (tender-minded) 


to education. 


59—Emphasis in naturalist attitudes to 


education. 


Survey of Opinions about Education. 
No. extreme answers (naturalistic). Survey 
of Opinions about Education, s 


(2) BIOGRAPHICAL PARTICULARS. 


60—No. of G C.E.‘O’ or‘ A’ subjects. 


61—No. of G.C.E. ‘A’ subjects. 
62—Degree class. 


63—Grant. 
64—Previous teaching experience. 


85—-Degree course tutor's reference. 
66—Interview grade. 


67—Social status of school. 


88—Age of university. 


69— Arts Faculty. 


70—AÀ ge. 

71—Male Sex. 

72— No. of University offices. 
73— No. of University societies. 
74—No. of interests. 

75—No. of games. 
76—University residence. 


77—Father’s education. 


78—Mother's education, 


No. of subjects passed either at ‘O 'or' A' 
level in General Certificate of Education, 
i.e., subjects in which at least ‘O’ level is 
reached 


No. of subjects passed at ‘A’ level in General 
Certificate of Education. 

First class honours —5. 

Second class honours (upper division) =4. 

Second class honours (lower division) 23. ` 

Third class honours 2 

Pass degree=1. 

No degree=0. 

State scholar=2, 

Otherwise grant aided=1. 

No grant=0. 

Questionnaire. Any form of teaching, e.g., 
supply, Sunday School, armed forces=1. 

No teaching=0. 

A-E scale. 

Mean grade of 2-3 individual interviewers 
working separately. A-E scale. 

Independent School—2. 

Direct Grant=1. 

Grammar School=0. 

Classified as follows : 


Exeter=0. 

North Staffordshire=0. 

Aberystwyth=0. 
Arts—2. 
Social Sciences 1. 
Sciencez-0. ' 


uestionnaire. No. given. . 
estionnaire. No. given. 

niversity Hall of Residence —2. 

Lodgings=1. 

Home=0. 

University =5. 

Training College--4. 

Public School=3. 

Grammar School, High School —2. 

Technical School, Commercial School, Trade 
School: 1. à 

Elementary Schoole-0. 

As ' Father's education." 
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79—Parent’s occupational level. ui I5 Classification of Occupations 
oJ. 
80— Large family size. No. of children. 
81—Late position in family. Ist, 2nd, etc 
46—Religious affiliation. Questionnaire 
undamentahst Sects =5. 
Nonconformists =4, 
Coo ‘ Roman Catholics —3. 
Church of England — 1. 
No rehgion —0. 
47—Right political affiliation. Questionnaire. 
Conservative 4. 
Liberal ==3 
Labour=1. E 
Communist- 0. 
99—Entry grade. A-E scale A mark given before entrance to 
the Department of Education. It is based 
on the applicant's complete record, includ- 
ing examination, tutor's reference, mter- 
view grades and life record. 


(3) CRITERIA oF SUCCESS. 


91—-Final Teaching Mark. A-E scale. Graduate Certificate in Education 
examination. 
97—Final Theory Mark " ‘i is - 
98—Final Certificate Award. X » »i is 
82—Tutor’ 8 Teaching grade (Michaelmas A-E scale by tutor supervising teaching 
Term)’ practice 
83—Tutor’s teaching grade (Lent Term). A-E scale by tutor supervising teaching 
ractice. 


P 
84—Schools’ teaching grade (Michaelmas A-E scale by headmaster of school in which 
te 


rm student's school practice was carried out. 
85— School's teaching grade (Lent term). o - 3» » 
86—Tutor'steaching checklist (Michael- ^ Check list of 43 items concerned with 
mas term). student's disciplinary control, lesson con- 
struction, attitude to pupils, etc. 


» »” "n ” 


87—Tutor’s teaching check list (Lent term) 

88—Extra Tutor’s teaching check hst. 

89—School’s teaching check list (January). 

$0—School’s teaching check list (April). ij m 55 

92— Psychology of education examination A-E scale Graduate Certificate in Education 
examination. 

93—Philosophy of education examination. 

94— History of education examination. 

95— Health education examination 

96—-Essay. 

100—Curriculum and Method examination 


Scores on the 100 variables were transformed into normalized T scores. 
The resulting 4,950 product moment correlations were calculated, on the 
Manchester University electronic computer. Several factor analyses were also 
carried out, which it is hoped to report in a later paper. 


(d) Particulars of the main criteria. 


The prediction of teaching success to be reported in the next sections will 
be in terms of three criteria, (a) final teaching mark, (b) final theory mark, 
(c) award or non-award of the certificate. All three refer to the Graduate 
Certificate in Education of the University of Manchester. In general, the 
process of arriving at these marks and at the award is believed a be fairly 
typical of the procedure in other university departments of education. 
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Final teaching mark is based primarily on a tutor’s observations and 
assessment of a student during about twelve weeks of school practice, although 
reports from the schools and from an external examiner will, in many cases, 
be bound to affect it. Final theory mark is based on the average of five exam- 
ination papers and one long essay. Award of the Graduate Certificate represents 
the decision of a group of examiners on receiving reports of final teaching mark 
and final theory mark, together with the testimony of examiners who know the 
student. 


TII.—THE PREDICTION OF SUCCESS AT TEACHING. 


(a) Prediction from individual measures. 


All significant correlations obtained between any of the tests or biographical 
variables and the three main criteria are shown in Table 2 below : 


TABLE 2 


SIGNIFICANT CORRELATIONS BETWEEN THE PREDICTORS AND THE MAIN CRITERIA OF 
Success 


TESTS . 

7—Vocabulary .. .. ... ... 

8—GeneralCulture — .. 
11—Ego strength (16 P.F ys 
14—Conscientionsness (16 P E) 
16—Sensitivity (16 PF) . : 
18—Unconventionahty P PF ) 
23—Self-Control (16 P : 
36—Economic Values (V o 


44—Tendermmdedness in education 
49—Accuracy eter intelligence) 


52—Accuracy (vocabulary) 

54—-Extreme responses to personality question- 
naire 

57—Emphasis on scale of radicalism in education. . 


BIOGRAPHICAL PARTICULARS 
60—G CE 'O'or'A' 
62— Degree class 
63—Grant 
65—Degree Course, Tutor’s reference 
66—Interview grade . 
72—No. University offices held 
74—No of mterests 














All the correlations in the above table are significant at the -05 level, except 
those marked with an asterisk, which are at the -01 level. 
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(i) Prediction from Psychological tests. 


' Table 2 shows that of the 171 correlations between test scores and the 
three main criteria, only twenty-two (13 per cent.) are significant. This may 
seem a small number, but it must be remembered that we would not necessarily 
expect a large proportion of variables to be related to success in teaching, or 
in any other particular profession. As regards the type of test, significant 
correlations were obtained for 17 per cent. of the cognitive test scores, 18 per 
cent. of the personality measures, Ü per cent. of the interests, 10 per cent. of the 
values and 15 per cent. of the attitudes assessments (22 per cent. for attitudes 
to education). There is an important reason, however, why the size of the 
correlation coefficients should not be taken quite at its face value. This is the 
familiar problem of correction for range. Although it is impossible to state the 
extent to which our sample of graduate students forms a selected group com- 
pared with the total group of successful and unsuccessful applicants for entry 
to the course, since the degree of selection and consequent narrowing of range 
will be different for every variable, there is little doubt that if the tests and 
predictors had been administered to all candidates, the majority of the correla- 
tion coefficients would have been markedly higher, and many of those that are 
at present only suggestive would become statistically significant. A striking 
example of the effects of correction for selection in a rather similar situation is 
provided by Vernon's (1950) study of Civil Service Selection procedure, in 
which all the separate, uncorrected validities are extremely low, but, when 
corrected for selection and combined, yield a considerable degree of prediction. 
On the whole, the results indicate the relative success of two single long tests, 
Cattell’s 16 P.F. personality test, and Oliver's “Survey of Opinions about 
Education," and the relative failure of the cognitive tests and the measures of 
interests. For success on the theoretical side of the course, culture and vocabu- 
lary are important, suggesting that Arts students, with their literary and verbal 
background, have an advantage in this respect. Ego-strength, conscientiousness 
and lack of extreme personality characteristics are also important, perhaps in 
the preparation of examinations. There is also some indication that tender- 
mindedness, radical attitudes and originality (unconventionality) are a factor in 
examination success, i.e., in pleasing the examiners. , 


| The ' General Culture Test ' emerges as a better predictor-of theory mark 
than any other single test. This unpublished test (Butcher and Warburton) is 
prima facie, a rather crude measure of extent of reading. It consists of 100 
items, the large majority of these being the titles of books, with the respondent 
‘being required to name the author. In the remainder of the items, the same 
principle is applied to well-known paintings and musical compositions. . : 


: Turning to the variables that do nof significantly predict theory mark, it is 
‘noticeable that all six of the ability tests (variables 1-6) fall into this clàss, 
presuinably because of the high general intellectual level, and hence the narrow 
range, of the sample. It is worth noting that whereas one at least of each type 
„of test correlates with the Final Theory Mark (and hence with the Certificate 
Award), only personality scores are related significantly to practical teaching. 
. The finding that practical teaching is harder to predict than theoretical success 
is typical of research in this area. The personality qualities related to Practical 
Teaching are self-control (Q3), conscientiousness (G) and sensitivity (I). This 
finding is in agreement with an American research by Erickson (Barr, e£ al., 
1961), who administered the 16 P.F. to a group of sixty-four teachers in Wisconsin 
High Schools. Erickson found factors Qs, G and M- to be related to teaching 
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success, with obtained correlations which were all of the order :2 to :3. The 
first two qualities are aspects of the major second order personality factor of 
stability, while sensitivity is related to tendermindedness, an attitude which is 
shown throughout this study to be positively correlated with success on this 
training course. This is not a surprising result in view of contemporary trends 
in education. 


Positive correlations between factors G and Q, and teaching success are 
reported in the 16 P.F. handbook, but the correlation with factor I (Emotional 
Sensitivity) is a new finding which may, however, be related to cultural differ- 
ences between the U.S.A. and Britain. Certainly, both subjective observation 
and empirical research (e.g., Cattell and Warburton, 1961) suggest that this 
quality of sensitivity or tendermindedness is both more highly valued by and 
more typical of the European than of the North American culture-pattern. 
One value of the 16 P.F. test in the present context appears to be that it alone, 
among the wide range of tests used, predicts teaching practice better than 
teaching theory. 


Besides the significant correlations, it is interesting to note those that, 
perhaps against expectation, are of negligible size. In particular, verbal and 
non-verbal intelligence are irrelevant to teaching success in this sample, except 
in so far as non-verbal intelligence shows a negative (but non-significant) 
correlation of —-125. Similarly, the extensive series of scores provided by the 
Kuder Preference Record and the Allport-Vernon measures of values fail to 
yield any significant correlation, although one or two are suggestive of some 
relationship. 


Non-significant but suggestive correlations with teaching mark are shown 
below : 


ABE ccs Sod ind Ce SWR CENE O il baie Saeed ats 4-173 
No. of G.C.E. passes 0... eee cece cece eee eee + +172 
No. of interests (as stated on application form for 
entry tocourse) ......... ese eee eee eee eres + +166 
Family Sié «esee rere ree te tates +-165 
Accuracy on vocabulary test .................. + +165 
‘Verbal fluency ........ — enm + +155 
Naturalism in education ...............0e0e eee —-151 
Paranoia or suspiciousness (16 P.F.) ............. —:142 
Social interests (Kuder) .................0..02.. 4-138 
Economic values (Allport-Vernon) .............. —:136 


In general, these correlations, although small, make quite good psycho- 
logical sense and require little explanatory comment. The concept of ‘ natural- 
ism’ in education (Butcher, 1959; Oliver and Butcher, 1962) refers approxi- 
mately to a ' child-centred,' ' progressive’ grouping of attitudes. A teacher 
scoring high on the scale of naturalism would be expected, for instance, to 
approve and follow many of the precepts of John Dewey. 
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(ii) Prediction from biographical data. 


Twenty-five per cent. of the correlations between biographical particulars 
and the three main criteria are significant as compared with 13 per cent. for 
tests. As with the psychological tests, examination success is easier to predict 
than practical teaching. Academic capacity is important, but there is also a 
relationship with breadth of interests (number of interests and university offices 
held). Two points of interest may be mentioned about these results in respect of 
theory marks. Firstly, the correlation between degree class and theory mark is 
' 1emarkably high, particularly when one considers the necessarily coarse grading 
in which degree class has to be expressed. From the point of view of predicting 
theory mark (and, indéed, award of the certificate), degree class is much more 
efficient than entry grade, although all available evidence is taken into account 
in arriving at the latter. 


Secondly, this is also true of tutor's reference. Both theory mark and 
award of the certificate are better predicted by the report on a student's 
performance as an undergraduate than they are by entry grade, although the 
latter includes knowledge of the former. Degree class is already known to the 
interviewer, and entry grade is based on all the information available to the 
interviewer, on the results of the interview, and on any further information 
that may have subsequently become available, such as additional references. 
Practical teaching ability is related to four variables but these essentially 
comprise academic level only. 


Putting the test scores and biographical data together, the good classroom 
teacher appears to be essentially stable, sensitive and of good academic capacity. 
Students successful in examination have the same characteristics, together with 
good verbal capacity and tenderminded and radical attitudes to educational 
problems. 


(b Mulivple Correlations. 


Turning to the multiple correlations (Table 3), we see that the personality 
variables are the best set of predictors for practical teaching, the theory exam- 
ination and the final certificate award. These variables are all derived from a 
single test, Cattell’s 16 P.F., which can be administered in 40 minutes. It is, 
of course, commonplace knowledge that personality is of a very considerable 
importance, and possibly of paramount importance, in determining academic 
capacity. The present results suggest that it is no longer necessary to neglect 
personality test scores in predicting student performance. Interest tests scores 
are also more efficient predictors than the eight ability tests. The test of general 
attitudes was understandably less effective than the test of educational 
attitudes. There is a tendency, but it is not very marked, for the multiple 
correlations to be higher for the theory examination than for practical teaching. 


An interesting finding was that, on the whole, twenty-one biographical 
particulars afforded better prediction (-871) than forty-two psychological 
variables (-639). It should, of course, be noted that in practice beta weights 
on different samples tend to fluctuate wildly and multiple correlations fall 
considerably when the same weights are applied to later samples. 
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TABLE 3 


MULTIPLE CORRELATIONS. 


15 personality variables 
10 interest scores 





Aa pa sychological variables 
iographical particulars 





IV.—IMPLICATIONS FOR SELECTION PROCEDURES. 


Although considerations of space prevent a full discussion of the several 
factor analyses carried out, it will be convenient simply to list the seventeen 
main factors which emerged in the most satisfactory of the analyses and to 
discuss the most promising predictive measures under these headings. One 
factor loaded almost exclusively on criterion variables and was interpreted as 
teaching ability. The remaining sixteen were interpreted as follows : 


Academic success.* 

Verbal ability. 

Non-verbal ability. 

Stability of personality.* 
Extraversion. 

Radicalism. 

Tender-mindedness.* 

Emphasis in expressing opinions. 
Interest in arts subjects versus science subjects. 
Aesthetic interests. 

Participation in social activities.* 
Socio-economic status. 

Family size. 

Assessment on entry to Department.* 
Verbal fluency. 

General culture.* ' 


Landi a p a a a a a a p, 
R e e 2.8 X9 e a .O Qe. 


Six of these factors were loaded by variables which appeared to have a 
‘promising relation with the main criteria, and these have been marked by 
asterisks in the above list. They are also shown, with the more promising test 
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and biographical variables, in Table 4-below, which includes the combined 
predictive effects of the groups of variables. The correlations would probably 
be appreciably higher if the technique of multiple correlation had been used in 
Place of the ate poolng with unit weights. 


| 


TABLE 4 ! | T 


TESTS TO BE USED IN SUGGESTED SELECTION PROCEDURE. . 





(1) ENTRY ASSESSMENT. 


Final entry grade 
Tutors’ reference (degree course) 





Total (unit weights) 





(2) AcapEMIC SUCCESS. © 
University degree class 


Total (unit weights) 


3) GENERAL CULTURE. 
neral Culture (Butcher and Warburton) . 
Vocabulary (Thurstone) 


Total (Unit weights) 

(4) STABILITY. 
Conscientiousness (16 P.F.G.) 
Self-control (16 P.F.—Q3). 


Total (Unit weights) 





(5) TENDER-MINDEDNESS. 
Sensitivity (16 P.F. —I). 
Tendermindedness in Education (Oliver) 





Total (Unit weights) 

(6) SocrAL ACTIVITIES. 

No. of expressed interests 
No. of University offices held 
Total (unit weights) 


ALL Tzsrs (unit weights) 





t 


In considering these findings, it is also interesting to note the factors to 
which success in the teaching course is largely unrelated. These are verbal and 
non-verbal ability, extraversion, radicalism, emphasis in expressing opinions, 
arts versus science. interests, aesthetic interests, socio-economic status, family 
size and verbal fluency. 
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It would, therefore, appear best in selecting potential teachers to concen- 
trate on attainment and general culture rather than ability, on stability rather 
than on extraversion or introversion, on academic rather than aesthetic interests, 
and on participation in social activities rather than on social or domestic back- 


ground. 
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CRITICAL NOTE 
SPECIFIC DYSLEXIA 


By D. RUSSELL DAVIS* AND ASHER CASHDAN f 
(Medical Psychology Laboratory, University of Cambridge ) 


In areas of psychopathology where aetiologies are but 1mperfectly understood, 
and where methods of treatment are still a matter of argument, 1t 1s not surprising 
that controversy should rage over appropriate groupings of symptoms and the 
terminology most suitable for them. One such case is that of specific developmental 
dyslexia (formerly word-blindness), which under its new name has recently been 
receiving fresh attention (see Zangwul, 1960 ; Vernon, 1962; Miles, 1962) In our 
opinion, the controversy over the existence of this condition is often based on an 
imperfect understanding of the cardinal principles of scientific investigation and 
classification. ; 


A disquietingly large number of children suffer from serious backwardness in 
reading Some catch up quickly, others never learn to read well. Some respond to 
specialist remedial teaching , others show a temporary improvement, but relapse 
as soon as the special teaching is withdrawn (see Collins, 1961); others show no 
positive response at all Some of the sufferers are girls, but more of them are boys. 
Some are of normal or superior intelligence, far more of them are of low intelligence. 
Many of them have other educational disabilities; many also have emotional and 
family problems. A few of them have unusual patterns of handedness and eye 
dominance ; a few also are claimed by some investigators to have special perceptual 
difficulties, or difficulty in associating symbols ; such perceptual difficulties may be 
limited to the activity of reading, or may be more general. 


The above list is not exhaustive, but implicit in 1t is the problem that concerns 
us. This is, to what extent are any, or all, of these factors associated with each 
other. If they are, we need to know which of the associations are causal ones, and 
in which direction the causal relationship operates. Forinstance, does backwardness 
in reading produce emotional disturbance, or do emotional disturbances lead to a 
refusal to learn to read ? Or are both true? 


If we studied and treated each case of reading disability separately, we might 
achieve fair success in curing the disability (or removing the impediment to learning). 
We would not, however, be able to draw any conclusions about the cause of the 
disability or the mode of operation of the ' cure ,' unless we were able to plan and 
carry out the treatment experimentally—and a single case 1s rarely sufficient for a 
satisfactory experiment. Again, if we’ gave identical remedial treatment to two 
groups of children, each having distinct features—such as right-handed and left- 
handed children, or children of low and high tested intelligence, or simply boys and 
girls—suppose then that one group improved significantly more than the other, we 
might conclude that left-handed children, say, have more persistent difficulties than 
right-handed ones, or that girls recover more easily than boys Even in such a 
situation, alternative hypotheses would be tenable, but these alternatives could 
reasonably be investigated However, the fact that one group had done better than 
another would have justified the classification (provided always that the result was a 
stable one). In other words, it would be reasonable to classify retarded boys 
separately from retarded girls uf the two groups responded differently to treatment. 
But, uf the response of the two groups was not clearly different from each other, 
no case would so far have been made out for classifying them as separate groups of 
retarded readers. 


* Now at the Department of Mental Health, Bristol University 
t Now at the Department of Education, Manchester Umversity 
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. In fact, if any category is to be established within the group of backward 
readers, it must, to serve any useful purpose, be demonstrably different in a sig- 
nificant respect from other (or some particular group of) backward readers. The 
purpose of such a differentiation would be to make easier the task of preventing, 
curing, or forecasting the course of the condition. The study of symptoms, and their 
classification, is only useful in so far as it throws light on psychological mechanisms, 
or is somehow relevant to these three tasks 


` If a new category is ‘proposed, we are suggesting then that it must satisty one 
of the following criteria. It must be different in respect of aetiology, prognosis, or 
treatment indicated. If the new category differs only in respect of symptoms, then 
the differences are either trivial or point to.a,need for further study of the relevant 
aetiology, prognosis or treatment. `” 


Let us now, in this light, examine the proposed category of specific dyslexia, 
and taking each criterion in turn, ask whether the members of this proposed category 
can be differentiated from other backward readers. 


Little is known of the aetiology of any form of reading backwardness. There 
is almost certainly some relationship with low intelligence, and with social class, 
but even this is not well documented. Hereditary factors have been suggested by 
some investigators, but there is no clear-cut evidence. Hallgren (1950) found 
hereditary factors in almost all cases ; this finding is so divergent from those of other 
workers as to make us hesitant to accept any of his conclusions. Zangwill (1960) 
found a significant number of affected relations in the families of dyslexics with 
handedness problems. But his work is also inconclusive. Such studies raise, 
incidentally, the question of the value of history-taking to establish familial features 
in pathological groups. Quite simply, uf the mother of a child with some pathology 
is invited to produce relatives with similar histories, she 18 highly inclined to do this 
(and at the same time to alleviate her own guilt feelings). The mother of a normal 
child has no such inclination. An investigation based on history-taking may be of 
value, however, if it is designed deliberately to show whether the pattern of incid- 
ence of affected relatives conforms to the one predicted from a particular hypothesis. 
Hallgren's hypothesis, for instance, was that the pathogenic gene is dominant, but 
his method of proving that the observed pattern of incidence does not depart sig- 
nificantly from the predicted pattern was hardly satisfactory. 


So much for aetiology We can do no better on prognosis As Vernon (1962) 
has pointed out, we certainly need to know why, 1n groups of children given remedial 
teaching, some improve while others do not, but in fact we do not yet know why 
this happens. Similarly, too, with methods of treatment. Miles (1962) says that 
“there 18 broad agreement as to the general type of remedial teaching that is 
normally called for," and he advocates, amongst other things, emphasis on phonic 
rather than on look-and-say methods, and the use of all relevant sense-modalities 
in the learning process. We would agree with his prescription, but still await 
evidence that it is indicated for any specific group of poor readers. 

We cannot, therefore, agree that there is any good evidence for Vernon (1962)’s 
assertion that ' there does exist a number of cases who have great difficulty in 
learning to read, which difficulty seems not to be attributable to environmental 
factors or to emotional maladjustment.” There are many reasons why psychologists 
and others might wish to spread such a belief. Some of these are related to the 
clinical management of cases, and others to the desire to support a priori theories 
about neurology and genetics. But there is not much evidence to support the 
belief. 

There are two legitimate ways in which advances may be made in this field. 
One is by clinical investigation and by testing theories and hypotheses in the 
clinical situation, or even by developing new insights there. But this has to be 
followed by the second method, which is to examine the relevance of separable 
factors (whether aetiological, symptomatic or prognostic), one at a time, 1n specially 
selected groups of children who are backward in reading. 
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To sum up, it is legitimate to adopt for research purposes the hypothesis that 
there 13 a special class of backwardness in reading, which we may call specific 
developmental dyslexia, and which has one or more characteristics, and then to 
carry out experiments to test the hypothesis itself, or, more likely, a deduction from 
it, Whether such a hypothesis 1s a promising one at the present time 1s a matter of 
opinion. But it would only be legitimate to assert or imply that the existence of this 
class had been established if evidence could be provided of a quality which deserved 
general acceptance. We have indicated in this note the sort of evidence which would 
be required, and we have given our opinion that 1t has not yet been provided. 
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SPECIFIC DYSLEXIA 


A REPLY TO DR. RUSSELL DAVIS AND MR. ASHER CASHDAN 


By M. D. VERNON 


It was made clear in my paper, “ Specific Dyslexia," that the classification of 
certain cases of reading backwardness as ' specific dyslexics ’ was presented as a 
hypothesis based upon the association together of certain symptoms and types of 
behaviour with reading backwardness with above chance frequency, and in the 
absence of certain other characteristics such as low intelligence and emotional 
maladjustment (except in so far as the latter developed after rather than before the 
onset of reading difficulties). There seems to be reliable evidence to support this 
hypothesis, notably that obtained by Dr. Ingram (to whom Dr Davis and Mr. 
Cashdan do not refer) This hypothesis does no more than suggest tentatively 
what might be the causal factors, which are obviously very obscure. But it is, I 
suggest, scientifically interesting, and valuable in so far as it may stimulate further 
research (which may, of course, disprove it). Whether the hypothesis is useful in 
“the task of preventing, curing or forecasting the course of the condition ” is a 
medical question on which I am not qualified to adjudicate ; though I would at least 
suggest that it might be desirable to consider whether backward readers who 
might be ‘specific dyslexics’ should be treated in a somewhat different manner 
from those whose backwardness appears to be due primarily to environmental 
factors. Incidentally, I think that the term ‘ specific dyslexic’ may bo preferable 
to that of ‘ specific dyslexia ' in so far as the latter suggests the existence of a single 
well-defined clinical entity, which seems improbable. 


I entirely agree that the clinical diagnosis of cases of reading backwardness 1s 
often woefully inefficient, and I have said so repeatedly. I have also urged that 
much more thorough and systematic experimental examination is necessary of the 
psychological processes characterizing backward readers, as well as of their clinical 
histories. I agree also that all too little is known about prognosis and treatment. 
But I cannot agree that “ the second legitimate way of advance ” must be to 
examine the relevance of separable factors one at a time. This method has been 
utilized again and again in the past with little useful result. What we need to 
observe now is the nature and frequency of the association between certain 
aetiological, symptomatic and prognostic factors in backward readers. When these 
have been established, it may be possible to offer further hypotheses as to causality. 


In support of what I have said here, I should like to quote from a paper in a 
recently published book, Reading Disability: Progress and Research Needs in 
Dyslexia, edited by John Money, Associate Professor of Medical Psychology and 
Paediatrics at Johns Hopkins University (Baltimore, The Johns Hopkins Press, 
1962): “ It is not at all rare in psychological medicine, nor in other branches of 
medicine, that a disease should have no unique identifying sign, that uniqueness 
being in the pattern of signs that appear in contiguity Out of context, each sign 
might be encountered in other diseases, or, ın different intensities, in the healthy. 
Specific dyslexia is no exception in this respect." 
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A BRIEF COMMENT 


ON 


THE EFFECT OF SOCIO-ECONOMIC STATUS ON THE SCHOLASTIC 
ACHIEVEMENT OF SIXTH-GRADE CHILDREN, by ROBERT L. CURRY* 


By R. R. DALE 
(University College, Swansea) 


The importance of Curry’s admirable article increases the danger that his one 
faulty conclusion may be repeated by research workers without a careful appraisal 
of the data from which his finding is drawn. His first conclusion (page 48) is that 
“ Socio-economic status seems to have no effect upon the scholastic achievementt 
of sixth-grade students when the students have high intellectual ability " Un- 
fortunately his data do not warrant such an assumption. Even within his high 
intellectual ability group, the same trends are apparent as within his low ability 
group, the upper socio-economic class gaining superior scores in all cases. The 
lack of statistical significance for these superior scores may be due to the smallness 
of the samples which the experimenter has selected. We cannot say that there 
seems to be ‘ no effect ' when the effect merely fails to reach statistical significance. 
All we can say is that the effect may have occurred by chance ; we certainly cannot 
say in this case that it has probably occurred by chance, and to say that it has 
certainly occurred by chance—which is the equivalent of saying that in this case 
socio-economic status has no effect—is statistically unthinkable. 


There is also a second fact which deserves consideration In work which 
produces a patterned result there is a tendency for us to ignore the importance of 
the pattern when we are considering questions of statistical significance. In Curry's 
results there are in effect eighteen differences between socio-economic groups to be 
considered ; of these sixteen are in the ' expected ' direction, and this in itself has 
its statistical importance. If we take the extreme socio-economic groups, comparing: 
upper with lower, there are nine differences in mean mark to be considered, and all 
nine are in the ‘expected’ direction The ' t ' ratios, therefore, have a cumulative 
effect, and although when we compare the attainment levels of different social 
classes the individual ' t' ratios are lower in the high intellectual ability groups, we 
might well conclude that '' the trend is continued but lacks statistical significance ; 
such significance might be obtained in future experiments by increasing the size of 
the sample." We need to bear in mind, also, that although we have by consent 
drawn an artificial line between ‘significant’ and ‘ non-significant,’ there is in 
actuality no such barrier, but a gradual transition. 


* This Journal, May, 1962, p. 46. 
tin Reading, Arithmetic and Language. 


RESEARCH NOTE 


THE RELATIVE PERSONALITY ADJUSTMENT OF SEVERELY DEAF 
AND PARTIALLY DEAF CHILDREN 


By L. R. BOWYER, A. MARSHALL axb K. WEDDELL. 


The policy, consequent on improved electronic aids, of placing partially deaf 
children in ordinary schools, has raised a question of their need for special help with 
social and emotional problems. One hypothesis is that a partially deaf person may 
have more of a personality problem than one who is severely deaf. 


A pilot study was carried out in Bristol between May and September, 1962, to 
try out ways of investigating this hypothesis and to see if there was evidence to 
warrant more time-consuming research. Three groups were studied, each with 
ten children, respectively severely deaf, partially deaf, and of normal hearing. 
The age range was restricted to 8-10 years. The groups were matched by paired 
(or triple) comparison for age, sex and intelligence. Ranges of I.Q., rather than 
single I.Q. figures were matched, corresponding to the Stanford-Binet and Wechsler 
levels of Superior, Bright, Good Average, Average, Low Average, and Dull or E.S.N. 
For this purpose a judgment was made based on recorded test scores, teachers’ 
opinions, and the results of a Goodenough Draw-a-Man test. The severely deaf 
children (Grade III deafness) were all from one Special School for the Deaf. The 
partially deaf children (Grade II deafness) were either members of a deaf unit in an 
ordinary primary school or were individuals in classes of children with normal 
hearing. 


THE CRITERION OF ADJUSTMENT. 


The criterion of adjustment devised for this study was one of ‘ adequacy of 
fantasy,’ assessed from a Lowenfeld ‘ World ' and a Lowenfeld Mosaic pattern, with 
consideration also of qualitative features in the Draw-a-Man test. This criterion was 
adopted for two reasons. First, the Lowenfeld techniques do not need words either 
in the instructions or in response, thus equating the communication problem for all 


groups. 


Secondly, a recent American study of deaf children, (Brill, 1960) which used the 
criterion of how much each child was liked, showed a trend in favour of the severely 
deaf children, but it was not statistically significant. The authors think that this 
may have been because they omitted to separate those with deaf parents from those 
with hearing parents. In addition, however, the criterion of adjustment may have 
been too simple or one-sided. The present attempt to gain an idea of the inner 
experience of the children with the aid of projective techniques is aimed at balancing 
the picture from the other side. In a larger piece of research it would be important 
to use both ideas. In our pilot study use was made of the Vineland Social Maturity 
Scale, and we are grateful to Mr. Geoffrey Herbert for helping with several of these 
and with Goodenough and Mosaic tests. Attention was also given to teachers’ 
knowledge of the children's homes. Only the ‘ adequacy of fantasy ' results will be 
reported here. 
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THE SCORING SYSTEM. 


The ‘ World ’ and the Mosaic patterns were scored in the following way, with 
one point for each aspect : 








SAND TRAY Mosaics 
Recognisable Adequate Freedom Good Freedom Total 
Theme Content from Gestalt from possible 

(more than 4 adverse adverse Score 

categories in Clinical Clinical 

relationship) signs signs 

Es + + T + 5 


The validity of the World ‘Clinical signs’ (empty, unpeopled, rigid, over-fenced, 
aggressive) had been established in previous research (Buhler, 1951 ; Bowyer, 1956) 
and also for the Mosaics (empty centre, frame design, etc , Lowenfeld, 1952). 


RESULTS. 


The average scores for the severely deaf, normal, and partially deaf children 
were 4:1, 3-4 and 2-85. With the Sign test of statistical significance (Siegel, Non- 
parametric Statistscs, p.68), using the hearing group as a control or norm, the severely 
deaf children in this study are above average ın adjustment (N 10, x=3, p= 05). 
However, the partially deaf children are not below average to a statistically sig- 
nificant degree. 


The results of comparing the two deaf groups, in matched pairs, equated for 
sex, age, and intelligence, gives a difference in favour of the severely deaf which is 
not statistically significant, although it appears noticeable to the naked eye, so to 
speak. A colleague has suggested that a more powerful statistic, such as analysis 
of variance, might bring out any significance that may be there. 

It is possible to obtain a significant difference between the two deaf groups 
with the Sign test, if one abandons paired comparison, regarding it as having served 
its purpose in equating the groups. With the scores arranged randomly, p= -002 
can be obtained. By pairing (a) for intelligence regardless of age or (b) for age 
regardless of intelligence, then probabilities of -06 and -05 are obtained. Perhaps 
this means that mental age is the most important consideration for adjustment in 


physical handicap, since combining age and intelligence reduces the probability of 
difference in adjustment, thus : . 


Severely Deaf Partially Deaf 
JB. 5+ VOW. 3} 
S.B 5+ V.G. 2 
S.H 3 — S.R. 4 
J.A. 24 — : C.R 4 
O.C. 5+ G.B. 2 
M.H. = S.S. 5 
R.G. 4+ M.T. 1 
R.R 54 P.E. 24 
J.F. $= J.R. 3} 
ALR. 334+ RM 1 


N 9, x—3, p= +254 


In the above table, the children are matched for age, sex, and intelligence. These 
various results seem to offer some encouragement to continue research on relative 
differences between the severely and the partially deaf. 
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THE NEXT STAGE 


Two studies are planned. One is to look at the playground behaviour in those 
schools with deaf units, to find out how many social contacts take place between 
partially deaf and hearing children. The other is to proceed with a larger sample 
of children 1n a school which teaches both severely deaf and partially deaf children 
For 1t may be that the partially deaf child is less well adjusted only when he 1s 
educated as a member of a minority group 1n an ordinary school for hearing children, 
ie., adjustment may be a function of the Special School with its entire staff, 
buildings, equipment and grounds devoted to helping the handicapped child. The 
alternative hypothesis would be that even in this favourable environment the 
partially deaf child by the nature of his handicap, 1s a ' marginal man’ who fits in 
neither with deaf nor hearing children 


SUMMARIES OF RESEARCHES REPORTED IN DEGREE 
THESES 


A Survey and Discussion of the Use of Reports to Parents in Scottish and 
Other Schools. 


By F. J. R. HONIGMANN 


(Summary of thesis presented in part fulfilment of requirements for the degree of Ed.B. 
in the University of Aberdeen in 1902) 


Am. ' 
To review the practices in the issue of reports to parents; to compare the 
practice in Scotland with that in selected parts of England, Wales and the United 
States; and to consider the ideal form of such a report. 


DATA. 

Thirty-five education authorities in Scotland, twenty-two in England, nine in 
Wales and six in U.S.A. (total seventy-two) were asked to submit specimens of 
reports to parents. Twenty-nine from Scotland replied, providing a reasonably 
representative sample for Scotland. Replies from Wales and U.S.A. (large cities 
only) were also representative. In England, the responsibility for issue of reports 
seems generally to be left to individual schools ; requests were sent to a sample of 
English schools but less than half sent specimen reports. 


METHOD. 

Reports from secondary schools were selected for examination. Points con- 
sidered were: frequency of issue; format; nature and quantity of information 
given (examination marks, standards of comparison, ratings, space provided for 
remarks, aspects of performance assessed). 


RESULTS. 


Scottish reports tend to be restricted to an impersonal list of examination marks 
on formal subjects with very little provision for verbal comments or assessment of 
personality development, in contrast with English, Welsh and American reports, 


CONCLUSIONS. 

Developments in educational practice have not been reflected in the form of 
reports to parents in Scotland. Alternative forms which communicate information 
more effectively and aid parent-teacher collaboration are suggested. 


An Evaluation of Readability Formulae in assessing Age-Placement of 
Text Books. 


By MARGARET TAYLOR 


(Summary of thesis presented in part fulfilment of requirements for the degree of Ed.B. 
in the University of Aberdeen in 1962) 


Am. 

To study the validity of three American readability formulae, Lorge, Dale- 
Chall and Devereux, as applied to the age-placement of twelve British school reading 
books graded by publishers as appropriate for children aged 9 to 10 years. 


88 


Summaries of Researches Reported in Degree Theses 8g 


METHOD. 

The criterion used for evaluating the formulae was the combined judgement of 
forty-five experienced teachers. An order of difficulty among the twelve reading 
books was calculated from the teachers’ rankings, and the order obtained from the 
application of each of the formulae was compared with this criterion. The teachers 
also assessed other factors which contribute to the age-placement of the books but 
which are not measured by the formulae, such as format, appropriateness of content, 
exercises and illustrations. 


RESULTS. 

The teachers’ ratings showed an average intercorrelation of over +0:7 and, 
their combined judgment was, therefore, accepted as a reasonably consistent 
criterion. The Lorge and the Dale-Chell formulae showed moderately high correla- 
tion with this criterion (+0-7 and +0-6 on rank-order), while the Devereux formula 
gave a correlation of only +0-5. All three formulae correlated highly with each 
other (inter-correlations between --0-8 and +0-9). Age placements by the formulae 
tended to be too high by comparison with teachers' opinions. There was a correla- 
tion of +0:3 between the teachers’ order of difficulty and their ranking of the merit 
of the books as schoolreading texts. Additional comments from teachers and ratings 
of other aspects were found to add relevant information beyond that given by the 
formulae. 


CONCLUSION. 

The value of two of the readability formulae was established, but only within 
the limits of what these formulae aim to measure. Readability, however, is shown 
to be only one of several factors relevant to the appropriate age-placement of these 
School texts. 


A Gaelic Translation of the Wechsler Intelligence Scale for Children. 
Bv MARY MORRISON 


(Summary of tests presented m part fulfilment of requirements for the degree of Ed.B in 
the University of Aberdeen in 1961) 


Alm. 
To test the validity of a new Gaelic translation of WISC, with reference to the 
study of the effect of bilingualism on intelligence. 


SUBJECTS 

All Gaelic speaking children in Lewis, born in April to September 1948 
(inclusive), aged 11:6 to 12:0 at the time of testing. Of 130 such children, 128 
were tested. 


TEsts. 

In addition to WISC (Verbal and Performance Scales), two Moray House 
Verbal Reasoning Tests, Raven’s Progressive Matrices, teachers’ estimates of 
school performance, and indices of the occupational status of the homes and of the 
extent of Gaelic in the linguistic background of the homes. 


RESULTS. 
The main findings were : 


(i The Gaelic translation of WISC agreed with the Moray House tests and Matrices 
in showing an average score of 92 or 93 in the group. 
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(u) Gaelic children were at a disadvantage on performance tests, and on strictly 
timed tests. 


(iii) The form of WISC sub-tests recommended for use with Gaelic-speaking 
children are : 


Verbal—Comprehension, Similarities and Vocabulary (all m Gaelic), 
Arithmetic and Digit Span (both 1n English). 


Performance—Picture Completion, Block Design, Object Assembly, 
Coding and Mazes (all with Gaelic 1nstructions). 


The Effects of Treatment in a Special Class in a Junior School on Retarda- 
tion and Maladjustment with Special Reference to the Concept of Self 


By DAPHNE C, IVES 


(Abstract of thesis submitted for the degree of M Ed in the Unwersity of Manchester, 
December, 1961) 


AIM OF THE INVESTIGATION 


This research was undertaken ın order to assess the value, to eleven children of 
junior school age, of a year spent in a special class The class was attached to a 
primary school in a Lancashire cotton town which accommodated 350 children 
The children had been placed 1n the class for one or more of the following reasons 
they were borderline cases for placement in a school for E.S N. children; or, they 
were of low-average intelligence and severely retarded in one or more of the basic 
subjects ; or, because of some emotional disturbance, they had been unable to adjust 
themselves to a normal classroom situation. 

Special emphasis was put upon changes in the self-concept of these subjects. 


METHOD 


(1) Measurements and evaluations made at the beginning and at the end 
of the period included Sander’s tests of Social Under-Evaluation (SUV) and Non- 
Social Tendency (NST),! Raven’s Projection Test,* the Bristol Social-Adjustment 
Guide, forms 1 and 2,3 together with tests of persistence and level of aspiration 
designed by the present writer. In addition, a series of controlled observations were 
made by the writer, using a check list during free-play periods. Items which were 
checked were subsequently grouped under four separate headings: hostility, social 
interaction, withdrawal and constructiveness All of the data so collected were 
analysed by the Wilcoxon Matched-Pairs Signed Ranks test‘ to establish the direc- 
tion and significance of changes over the year. 


(2) Additional information was provided by (a) questionnaires which were 
completed at the beginning and end of the year on the basis of separate interviews 
with the children themselves and with their parents or guardians , (b) questionnaires 
administered to sixty-six children who attended the school to which the special class 
was attached concerning their attitudes to the children in this class; (c) question- 
naires completed both by the special class children and by a control group taken 
from the remainder of the school regarding their attitudes towards previous teachers ; 
and (d) four sociometric tests, covering the period 

Although none of the data described in the previous paragraph lent itself 
readily to statistical analysis, the information was found valuable in forming a 
qualitative picture of the effects of placement 1n a special class of this sort. 
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(3| Educational progress was measured by standardized tests of attainment 
administered at the beginning and end of the year. In spite of what appeared to be 
marked progress made by the majority of the subjects, however, the results did not. 
lend themselves to statistical comparison owing to the fact that many of the children. 
had failed to make any score on entry. 


RESULTS. 


(1) Significant changes were found to have occurred ın the areas of adjustment 
measured by a number of the tests used. Social under-valuations had decreased. 
(P« -01) as also had non-social tendencies (P< -025). Although the total number of 
adverse pointers listed in the Bristol Guides showed no significant change, there was 
a significant reduction in items classified U/W (unforthcomingness—withdrawal) 
and K (Knavery or psychopathic hostility to the institutions of society and its 
representatives as they impinge on the child), (P< -025 in both cases). The subjects 
displayed a more realistic approach towards future performance, as shown by a. 
decrease in the final discrepancy scores calculated from the test of level of aspiration. 
(P«-025). Observed incidence of hostility showed a marked dechne during the 
second half of the period (P<-01) and constructiveness showed a corresponding 
increase (P< -01). 

No significant changes appeared in ‘ conformity ’ as shown by Raven’s Projec- 
tion Test,* in ‘ persistence,’ or in the observed frequency of interaction or incidents 
classified as ‘ withdrawn ’ during the controlled observations. 


(2) The qualitative results of the experiment may be briefly listed as follows 2 

(a) The responses to the questionnaires given to the subjects indicated that 
they all had a poor self-view in relation to school work at the beginning of the year 
Ten of the subjects showed an improvement ın this respect by the end of the year. 

(b) The results obtained from the initial questionnaire given to a parent or 
guardian of each of eleven subjects, showed thatten of the eleven adults were prepared 
to co-operate with the special class teacher and were grateful that special help was 
being provided for their children. At the end of the year, eight of the parents were 
very satisfied with the social and academic improvements made by their children. 
One (a guardian) was not interviewed again and the other two had certain reserva- 
tions about the class, but were not wholly dissatisfied. : 

(c) The responses to a questionnaire given to sixty-six children in other 
classes within the school indicated that some of them regarded the special class 
children as being inferior to themselves. Remarks to this effect were made in the 
playground which could have had an adverse effect on the self-concepts of the sub- 
jects. 

(d) When the subjects were questioned about their attitudes to teachers, the 
incidence of self-referential statements was higher among the special class children 
1n both the ' Teachers Liked ' and the ' Teachers Disliked ' categories than among 
the ‘normal’ children. This indicates that the self-forming influence of teachers is. 
greater with extremely unsuccessful children. 

(e) The sociometric inquiries showed that of three children who were socially 
isolated within this group at the beginning of the year, one had become satisfactorily 
integrated by the end of the year. 

(f) Attainment tests indicated that ten of the eleven subjects had made 
sufficient academic progress by the end of the year to enable them to feel that they 
were beginning to acquire the basic skills. 


CONCLUSIONS. 


It should be emphasised that because of the small sample and the limitations. 
of the experimental situation, the following conclusions need to be regarded as 
tentative. 


1.—During the year of the experiment the subjects who had been designated 
‘ withdrawn ’ (criterion: ten or more U/W pointers on the Bristol Guides) became. 


92 Summaries of Researches Reported in Degree Theses 


significantly less so. It can be tentatively concluded, therefore, that relative success, 
or a permissive classroom climate, or both, help such recessive children to make a 
more adequate social-adjustment. 


2.—There was a decrease in hostile behaviour and an increase in the number of 
constructive interactions observed in the second half of the year. This suggests 
that providing opportunities for free interactions to children who have failed to make 
a good social-adjustment is beneficial, in so far as it helps them to function more 
adequately in a group. 

3.—After a year in the special class, the subjects tended to make their predic- 

tions on the level of aspiration test more in accordance with reality. As the level of 
aspiration situation is directly related to feelings of success and failure (a high 
discrepancy score indicating feelings of inadequacy) it is concluded that the decrease 
in scores is indicative of an improvement in the self-concepts of the subjects which 
could be attributed to one or more of the following reasons : (a) most of the subjects 
had begun to master the basic skills of reading and arithmetic for the first time in 
their lives; (b) they were feeling more socially competent because of the oppor- 
tunities provided for social rehabilitation ; (c) significant people in their lives were 
encouraged to provide self-enhancing experiences for them. 

4.—A close liaison with the parents was considered to beanimportant factor i in 
helping the children, both academically and socially. 


In the judgment of the writer it would be desirable for such special classes as 
this to be attached to schools which are oriented towards individual progress rather 
tlian towards competition in the form of examination positions and streaming by 
ability. Competition in this form leads to a certain amount of stereotyping which is 
not conducive to healthy self-evaluation on the part of the children who are un- 
successful within the system. It seems particularly important that children with 
problems of academic failure and social maladjustment should be placed with teachers 
who are temperamentally able to accept them and deal with them in a constructive 
manner. 9 
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The Opinions of Young Workers on their own Education in a Secondary 
Modern School. 


By FRANCIS S. JACKSON 


(Absiract of thesis submitted for the degres of M.Ed. mm the Umversity of Manchester, 
April, 1962.) 


PURPOSE or THE INQUIRY. 


The purpose of the inquiry was to obtain at first hand the adult opinions on 
education of a group of ex-pupils of a secondary modern school and to examine’ 
these from the point of view of (a) any relevance they might have to the organization 
and administration of such a school, and (b) their bearing upon how the school stands 
in the esteem of its own ex-pupils, the parents of tomorrow. 
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THe SAMPLE. 


This consisted of 142 young men drawn from four consecutive one-year age- 
groups. They had left the same secondary modern school at the age of 15 and were, 
at the time of the inquiry, between 20 and 24 yearsofage They represented 83 per 
cent. of the school’s total output of 15-year-old leavers over the four-year period in 
question, and included a wide range of occupations ranging from undergraduate to 
labourer. A simple statistical check revealed no significant differences in age and 
ability-grouping between the 83 per cent. included in the sample and the 17 per cent. 
omitted. 


Tue ScHOOL. 


The school was a small secondary modern boys’ school serving a sea-side town 
and the adjoining semi-rural area. Its organization and size (two form entry, with a 
single fourth form) was fairly stable during the period underreview. Thecurriculum 
incorporated no particularly distinctive features and no unusual teaching methods 
were in use. 


PROCEDURE, AND SCOPE OF THE INQUIRY. 


The majority of the respondents (115 out of 142) were mterviewed personally 
by the investigator in their own homes, usually 1n the evening. Those who, for 
various reasons, could not be reached in this way completed a questionnaire which 
covered the same topics as were dealt with in each interview. These were: 


(a) Respondent’s present occupation. 

(b) Record of further education (if any). 

(c) Opinions as to the adequacy or otherwise of the school as a preparation for 
employment. 

(d) The respondent's views on the relative values of various school subjects. 

(e) Opinions and ideas on vocational guidance. 

(f) Free comment or criticism on any seemingly relevant matters 


(g) An over-all assessment of each subject's satisfaction with his own secondary 
education on a seven-point rating scale. 


SuMMARY OF CONCLUSIONS. 


1.—The majority of the young men in this four-year sample of ex-pupils were 
favourably disposed towards their own secondary modern school There was a slight 
tendency for those in the upper half of the ability range to be more critical 
(constructively) than their fellows Extreme dissatisfaction was rare. 


2.—There was a distinct association between ability group and strength of 
demand for higher academic standards. A higher standard of mathematics was 
particularly requested The requirements of further education appear to be largely 
responsible for this. 


(3) Almost 72 per cent. of the entire sample (over 94 per cent. of the ablest 
quarter) had engaged in some form of further education. This had led, in two cases, 
to the university, and in many others to diploma and certificate courses of various 
kinds. Such achievements as these, especially in ex-pupils of so small an establish- 
ment, need to be remembered by those who would dismiss the secondary modern 
school as an academic ' dead end ' 


4 —Suggested additions to the curriculum (as ıt then existed) were metalwork, 
a modern language, and algebra. Requests for metalwork were numerous and 
appeared to bear no relationship to level of general ability Demands for algebra, 
on the other hand, showed a distinct association with level of ability. 
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5.—Many of the ' second best ' performers felt they had not received their full 
Share of attention, particularly in the basic subjects. They felt they should have 
been pushed harder, encouraged, and allowed to try some of the work which was 
evidently considered beyond them. 


6.—Music and history came in for heavy criticism as serving no worth-while 
purposeinthe modern secondery school. From the less severe critics came suggestions 
tnat music might be made optionaland thata new approach to history might be made 
via civics and current affairs 


7.—Mathematics, English, practical subjects, science, and geography were 
rated by ali ability groups as the five most generally desirable subjects. From a 
purely vocational point of view, mathematics, woodwork, and technical drawing 
were rated higher than any other activities. 


8.—Age, as reflected in the Christmas, Easter and Midsummer leaving groups, 
appeared to be significantly associated with level of ability, the upper ability groups 
containing a disproportionately high number of the Christmas leavers. Since this 
phenomenon has been noted elsewhere, its possible effects deserve further study 


9.—Approximately half the respondents declared themselves satisfied with 
existing vocational guidance arrangements, but some of the more thoughtful 
suggestions made pointed to the value of adequate school records and to the 
possibility of the detection of specific abilities as they emerged. 


10.—The level to which these young men have risen in their chosen occupations, 
their general record of stability in employment, and the level to which they have 
carried their education since leaving the secondary modern school suggest that for 
the majority of the 142 members of this sample their education at such a school 
has proved to be a not inadequate preparation for entry into the first phase of adult 
life. 


BOOK REVIEWS - 


ANSTEY, E. (1961). Staff Reporting and Staff Development. London: Royal 
Institute of Public Administration in collaboration with George Allen and 
Unwin, Ltd., pp. 96, 12s. 6d. 


Applied psychologists who have read publications by Edgar Anstey would 
expect this book to be meaty, concise and practically helpful. They will not be dis- 
appointed. A quarter of the book is made up of appendices showing exact copies of 
staif reporting forms in current use. In the remaining seventy odd pages Anstey is 
able to compress, in a highly readable form, balanced accounts of the objectives and 
methods of staff assessment, of confidential and open systems of reporting, the pros 
and cons of Promotion Boards and some very worthwhile advice on the design of 
report forms; he gives us the benefit of his experience in training in reporting and 
describes three kinds of staff appraisal interviews. A section which particularly 
impressed me was the five-page critical account of the forced-choice technique. 
I also liked the examples he gave of the ways ın which clever reporting officers could 
subtly twist the reports so as to secure the promotion of people they liked or the 
transfer of people they disliked. Obviously, Edgar Anstey has worked with very 
intelligent people. > 

On the face of ıt, this book is a valuable one for Establishment Officers in 
County Halls and Ministries but not one for educationists. However, those con- 
cerned wıth the gradıng and assessment of children, of students and of teachers 
and those who work with follow-up and validation in any field . . . all these will find 
Staff Reporting and Staff Development a very useful handbook. DENIS McManon. 


BznEDAY, G. Z. F., and Lauwerys, J. A. (1962). The Year Book of Education— 
The Gifted Child. Evans Bros., in association with The University of 
London Institute of Education and Teachers’ College, Columbia University, 
New York, pp. x4-541, 63s. 


Readers of this Journal will probably find this volume of the Year Book one of 
the most valuable of the series for it deals with theoretical and practical problems of 
giftedness to which society is urgently demanding answers of psychologists. The 
three major sections are headed Psychological Theories, Detection of Abihiy and 
Selection for Educational Purposes and Increasing the Pool of Talent. These are 
preceded by an introduction by Sir Cyril Burt whichanust surely be one of the most 
distinguished pieces of writing in educational psychology in recent years. Here in 
under sixty pages he broadly surveys the whole field and so orientates the reader 
for the detail which follows and provides him with a critical evaluation in advance of 
much of what appears in the rest of the book. 

Section One deals with theories as to the nature of giftedness and the educa- 
tional practices likely to stem from them. Of particular interest are Ruth Strang’s 
introduction, Guilford’s Parameters and Categories of Talent and Torrance’s 
Measurement and Development of Creative Thinking Abilities. The methods of 
selection of gifted children in a wide range of countries which are described in the 
Second Section are on the whole retrospective studies reflecting existing techniques 
which have grown up in the countries of their origin with little reference to advancing 
psychological knowledge. Most are based upon examinations and frequently include 
tests of academic aptitude. Some appear unsophisticated to British eyes. There 
are, however, some valuable accounts of researches designed to assess the efficacy of 
procedures, as, for example, in Passow’s survey and Husén’s work in Sweden into 
the age at which selection should take place. Contrary to many beliefs Husén 
concludes that early selection of the academically gifted does not benefit either those 
selected or those left behind. 

The third section concerns attempts of modern technological communities to 
meet their insatiable demands for people of high talents, and of rapidly developing 
countries to increase their available human resources as quickly as possible. The 
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proposed solutions are the familiar ones of recognition followed by appropriate 
educational environments designed to encourage maximum development commen- 
surate with individual genetic limitations. There ıs less than one might have hoped 
of discussion of the social problems satirised by Michael Young which inevitably 
arise as segregation of the most able becomes more efficient, though Anderson does 
touch on them ın his chapter on Dilemmas of Talent Centred Educalyonal Programmes. 


The levels of writing are inevitably uneven and few but a reviewer will have the 
fortitude to read the whole, but there 13 no doubt of the book's value for it gathers 
together a great range of thinking on the topic of giftedness and provides a very 
complete entry into the many aspects of the field. L. B. BIRCH. 


CrEvcH, M. F. (1962). Educating Older People. Tavistock Publications, 
pp. 185, 28s. 


Thus interesting book is an attempt to generalise the experience of the author 1n 
tutoring students for the diploma in teaching backward children working at the 
Institute of Education, University of London The author makes considerable use 
of what these students have said themselves in reply to queries about their experience 
of the course. The book has all the virtues and defects which arise from the use of 
such a method: flashes of msight occasionally light up the general picture of 
banality and do-it-yourself psychoanalytic rumination which constitutes the second 
half of the book. The treatment is very uneven since Dr. Cleugh seems to work at 
two different levels, as a rational analyst of that experience which she shares with 
trained teachers (the most fruitful part of her communication) and as an amateur 
case-worker who works on the most superficial and subjective level of analysis of 
ordinary human contact. In the days of my youth I became familiar with the kind 
of content which is now being widely publicised as the latest advances of psycho- 
logical insight—except that in those days it was described, perhaps unkindly, as 
“ stair-heed gossip." Iquote: 

" January 29. 

(Tutor: Are you keeping your head above water ? 

Mrs. S. : Yes, much better this term. 

Tutor: Good. Take it easy. 

Mrs. S. : It's only just above water, but it is TA (Seems steadier and more 

relaxed, in spite of brick on 17th.).” p. 129. 


The first hundred pages deal in a systematıc way with the problems of educating 
older people, under the chapter-heads of Getting things done, lecturing, discussion, 
supervision of practical work, examinations. This part of the book might be regarded 
ds a handbook for adult tutors . ıt is helpful, informed and systematic Although it 
is on an ad hominem level with very little reference to the scientific studies which 
have been made of the adult learner, there is nothing which one would fault as 
contrary to one’s experience or conclusions about the adult. 


The second half seems to have been the result of some kind of pressure to 
produce a ‘‘ proper book " ın the usual Tavistock format and style rather than a 
mere brochure of ''suggestions for the use of tutors." Dr. Cleugh expresses the 
view that although the tutor is not a professional therapist he can (and ought to ?) 
serve a therapeutic purpose. It 18 the attempt to become self-conscious and centrally 
concerned about this function with which one would quarrel. 

However, there have been so few books published in this country dealing with 
the adult student that one must compliment the enterprise of the author and the 
publisher, and hope that the book will be widely read since it will certainly be of 
value to many ditferent kinds of reader. Although not the most' economical in terms 
of time, one occasionally 1s 1lluminated in listening even to '' stair-heed gossip.’ 


Joun McLzisB. 
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Gorpon, I. J. (1962). Human Development. Harper, pp. 400, 46s. 


This book traces the development of the child from conception to manhood, 
preserves a good balance between the forces of heredity and environment, and 
successfully combines the physical, psychological and sociological aspects. The 
adjective which best describes the first two-thirds of the book 1s ‘ sound,’ and this 
is by no means damning with faint praise Two late chapters, on “ Exceptional 
Development" and ''The Continuing search for Identity " are very good. In 
general, both effectiveness and interest tend to increase as one reads on. 

The author endeavours to study the developing child as a whole while also 
examining the forces bearing on him at certam times ın his life, and seeks to adopt a 
' unifying theoretical position.’ The development of the self ‘is a transactional 
process between the developing organism and its changing social environment.’ 
He tries, to combine an '' external ” approach of looking at the child's home, his 
peer group, his school and his society, with the '' internal ” approach of looking at 
how he perceives these various situational forces, and how he perceives his own body 
and his own growth.' The theoretical account which is constructed on this basis is 
well supported by concise but 1nformative statements of recent research. Though 
there are some references to British work, there could profitably have been more. 

Though the book is particularly sound there are a few statements which need 
careful examination—though one indeed has seen them elsewhere. Gordon 
adumbrates that '' The direction of (man's) behaviour is always, in normal develop- 
ment, towards increased complexity." The intention here 1s good, but one wonders 
about the word ‘always.’ When man, for example, reduces a chaos of isolated 
facts into order by classification is he not simplifying and consolidating the position, 
maybe preparing for further complexity but not increasing it for the moment ? 
Similarly, we read ''the increase in complexity and organisation of the person 
continues throughout his life." Does this statement include old age? Much more 
important, however, is the widely accepted theory of ' equilibrium ' which Gordon 
states once again. The human organism '' tries to maintain a steady state, i.e., 
it attempts to maintain itself and to balance the forces within itself " While there 
is a central truth here the words do not describe exactly what is happening, and 
Gordon (and many other writers) contradict themselves when they maintain that the 
organism ‘‘at the same time develops towards a new steady state." The reviewer's 
position 18 that the developing child must contain, if healthy, & tendency towards 
greater maturity, towards development, therefore spso facto it cannot be ina‘ steady 
state.’ Physiologically, it is obvious that the child contains within himself the 
tendency to change, and this is also true psychologically. This often repeated 
concept of ‘equilibrium’ needs a much deeper examination and a much more 
thoughtful comment. R. R. DALE. 


HADFIELD, J. A. (1961). Childhood and Adolescence. Pelican Books, pp. 286, 5s. 


There are few books of child psychology, covering development from infancy to 
adolescence, which can be recommended confidently to parents who lack specialist 
knowledge of psychology. Dr. Hadfield has provided such a book by writing for 
“ ordinary intelligent parents of ordinary intelligent children." He has drawn on his 
experience as a psychiatrist to give many examples of children’s behaviour and 
dfficulties, and to make recommendations to parents about the treatment of children. 
These are founded on his belief that the fundamental need for normal development is 
a sense of security. It follows that he gives little emphasis to sexual problems as 
primary causes of neurotic symptoms ; he considers that mental health in childhood 
depends on parental love, on discipline which combines kindness with firmness, and 
on freedom from the inhibiting effects of unnecessary restrictions or ' over-protec- 
tion.’ Some readers may feel that genetic constitution and social environment also 
play an important part but, in so far as Dr. Hadfield considers the latter, he finds 
that a sense of inferiority and competitiveness play minor roles in the development 
of most children. 
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Following an introductory discussion of parents’ problems in bringing up child- 
ren, chapters two to four describe the stages of maturation, phases in early develop- 
ment and the organisation of personality during childhood. In chapter five emotional 
development in adolescence is described at some length. More serious neurotic 
disorders are dealt with briefly in chapter six and, in a last chapter, several ‘ golden 
rules for parenthood ' are suggested. 

Dr. Hadfield stresses the importance of allowing children to live fully through 
each phase of development, expressing each impulse before it 1s disciplined and 
directed. In disciplining children, he recommends the use of encouragement with a 
minimum of blame; but he considers that punishment of wrong-doing should be 
inevitable, with the proviso that the child's motives should first be understood. 

The long chapter on adolescence is likely to be most appreciated by parents ; 
for whereas writers in some popular journals deal knowledgeably with the difficulties 
of young children none deals so soundly with the problems of adolescence. The four 
Stages of sexual development outlined here may be a new conception to readers 
already having some knowledge of psychology. 

This is a well-written book, in a pleasant and persuasive style with a fund of 
illustrative anecdotes; the views expressed are likely to impress most parents as 
eminently sensible. It should do much to help them, as well as stuents and teachers, 
to understand the development of children and to alleviate their difficulties 


RUTH BEARD. 


HAWEES, GLENN R., and PEASE, DEMARIS (1962). Behavior and Development 
from 5 io 12. New York: Harper and Bros., pp. ix+375, 46s. 


This is rather a charming book which one would wish to have on one’s shelves 
(rather than order for the Library). It has an attractive format, clear print and 
good photographs. The content is remarkably free from distortion, considering the 
brevity of treatment of a wide variety of topics These are presented in fifteen 
chapters, each with headed sections and sub-divisions. The integrity of approach, 
and the deft, direct style, impose their own unity, and can stem only from being 
thoroughly at home with the subject-matter, and having masterly powers of selec- 
tion. The book is at once easy to read and motivating to more advanced study, for 
which references are given at the end of chapters. The work of British writers is 
included (e.g., for Ch. 9, “ growth and function of intelligence ’’ mention is made of 
research by C. Burt, M. Dunsdon and J. Fraser-Roberts, and G. Seth). 

The authors state that their book is about children in their natural habitat— 
their homes, their neighbourhood and their schools. The reader who expects to 
have a clearly separated view of each year from 5 to 12 will not find this, except fora 
very condensed ‘‘ summary of age characteristics ” (agreeing with Gesell’s accounts) 
in Ch 14 (pp. 315-330). The emphasis is more on the 5-12 ' school-age,’ as a whole, 
except for incidental differentiation. 

The book is ideal for groups of the weekly tutorial or seminar kind, either in 
training colleges, adult evening classes, or in university introductory courses in 
child development with a social psychology orientation. At the end of most 
chapters is a list of “ Study Problems." These are thought-provoking practical 
exercises not too onerous to be carried out with enjoyment in limited time. There is 
a chapter (13) which gives details and principles for observing and recording 
children’s behaviour. Appendix A and Appendix B list, respectively, films and 
selected pamphlets, each with a descriptive sentence. The last chapter is “ Adoles- 
cence, a look ahead ” with suggested readings, which would give a study group the 
basis for a follow-up course. 

It is assumed throughout that the parents and teachers for whom the book is 
mainly written will wish to use the knowledge provided. "There is also a specific 
chapter (12) on guidance, with a word of caution in the preface : '' The understand- 
ing parent or teacher will refrain from the use of specific techniques unless he has a 
philosophical understanding of the child and the demands being made upon him." 
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A lively feature is the introduction of quoted extracts, sometimes from children 
themselves, e.g., exchanging jokes and riddles (p. 227), or an 11-year-old writing 
about teachers (p. 53): '' There are some teachers who have just gotten tired of 
children because of the bothersome things they do, and do not like any of them any 
more. This kind do not help you learn very much.. ” The authors of this book are 
clearly not in this category. ''Being around children," they say (p. vui) ' may 
make the hair gray, but it keeps the heart young " It is pleasant to be able to 
recommend a book of high academic standards written with enthusiasm in a manner 
which meets the needs of the less academic and of those students and professional 
workers who would benefit from a moderately short, gay but not superficial text for 
this relatively little studied 5—12 age. L. R. Bowver. 


MANNHEIM, KARL, and STEWART, W. A. C. (1962). An Iniroduciton io the 
Sociology of Education. London: Routledge and Kegan Paul, pp. xvii4- 
187, 21s. 


This work is chiefly of historical interest. While it may be the last of Mann- 
heim, no one would claim that it was the best. Its fifteen chapters, arranged in five 
sections, are uneven in value and loosely strung together. It purports in its title and 
Introduction to constitute an introduction to the sociology of education. It does not. 
No beginner who read this book would, in fact, have been introduced to the major 
pre-occupations, problems and achievements, stillless the methods, which character- 
ize educational sociology in the mid-twentieth century 


One-third of the book is devoted to psychological topics : resumés of the more 
obvious and unsophisticated theories of learning and personality development. 
There is some advance beyond associationist theory : the laws of recency, frequency 
and effect are rehearsed, but attention is given to the social context in which learning 
takes place, to the assumption of roles within social institutions. This is both too 
little and too much. It is doubtful whether the beginner would make much of a 
passing reference to Allport's theory of ‘ functional autonomy’; for others it is 
really rather old hat. The chapters on social influences on personality development 
draw heavily on G. H. Mead ; they are written with only a bare hint of the contribu- 
tion which the cross-cultural studies of social anthropologists have made to this 
field of inquiry. 


The first third of the book and the last deal with matters which are more 
properly sociological. It is interesting, however, that in a chapter which usefully 
discusses teaching, training, and instruction, the temptation to explore the socio- 
logical implications of these distinctions is firmly resisted. The sociological chapters 
contain much that is platitudinous and trite. An ambitious survey of the ideals of 
the Graeco-Roman, Hebrew, Medieval, Renaissance, and post-Renaissance worlds 
illustrates the point that different ages modify the ideals (for example of ‘ the 
gentleman ') which they inherit, and that ' current attitudes have historical ante- 
cedents.’ 


Nowhere in the book are we introduced to the sociology of education as an 
experimental science concerned with the rigorous empirical testing of hypotheses. 
In the final chapter, it is true, a number of suggestions are made for research: a 
'survey of the existing educational institutions in this country and their inter- 
relations . . . '; the factors in modern life (work and leisure) which make for the 
disintegration of the personality ; the proper recruitment and role of élites if the 
cultural decline of our society is to be halted ; ' the balance between the contribu- 
tions of the home, church, school, factory, and public life towards moulding 
individual character.’ There is little apparent awareness of the formidable tech- 
nical problems involved in setting up inquiries which might yield answers to these 
great questions. This reviewer remains unconvinced that “ the right command of 
primary sources and of the existing literature on the subject already offers answers 
to a great number of them.” F. MuscRovz. 
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Money, J. (Edit.) (1962). Reading Disability. Progress and Research Needs in 
Dyslexia. Baltimore: Johns Hopkins Press, pp. x+222. No price given. 


The papers published here were submitted at a symposium held in the Johns 
Hopkins Medical Institutions in 1961 under the auspices of the Association for the 
Aid of Crippled Children There are thirteen symposiasts drawn from the general 
fields of education, psychology and medicine. In itself, this is a reassuring sign for 
medicine and paedagogy have been too slow in coming together in joint studies of 
mutual interest 

Workers in the field of severe reading disability will not be surprised that no 
dramatic break-through is reported or that Howes writes '' the field is still at a stage 
where we need to find out exactly what dyslexia 1s as well as why 1t occurs." It is 
noteworthy, however, that in spite of the opinions given by such as Torpe and 
Malmquist (who were not at the Conference), that no doubt 18 expressed as to the 
existence of the syndrome of specific developmental dyslexia. It is seen as a dys- 
function of written language and communication resembling the sequelae of some 
brain injuries. 

Most of the familiar topics are dealt with and brought up to date by reference to 
the most recent work and there are some topics which will be unfamiliar. On the 
educational side there would seem to be little new to report with regard to phenom- 
enology, or to recognition and treatment of the condition As Schiffman indicates 
we still await answers to the questions : What is the aetiology ? How can we make 
an early diagnosis? What are the best remedial procedures? It seems clear that 
answers are unlikely to arise from further large scale surveys and from correlational 
and factorial studies. It is pointed out by Zangwill that detailed clinical and 
experiemntal studies are most likely to throw light upon them. The latter half of 
the book is concerned with such studies though the papers are largely statements of 
points of view of workers in particular fields rather than the reporting of experi- 
ments. 

Birch, in his paper on Dyslexsa and the Maturation of Visual Function, insists 
that dyslexia is not itself a problem capable of being investigated since it may 
derive from a variety of aetiologies. This, he suggests, makes 1t necessary for any 
worker to decide at the outset which aspect of the problem he wishes to explore ; 
but, having made a decision, this does not mean that he regards other aspects as 
unimportant It is the main value of this book that workers who have made 
decisions of this kind have placed their considered opinions side by side so that each 
can be correlated with all others and the way forward seen a little less obscurely. 

This is essential reading for all workers in this field It follows on and supple- 
ments M. D Vernon’s (1957) definitive volume Backwardness in Reading bringing 
the bibliography up to 1961. L. B. BIRCH. 


Scorr, W. A., and WERTHEIMER, M. (1962). Introduction to Psychological 
Research. New York: Wiley, 60s. 


The writers of this book have set out to provide help to those who are embarking 
upon psychological research for the first time. They use the word psychological 
broadly and what they have to say is relevant to all the social sciences. 

There are some to whom such a book will be an anathema. They would aver 
that the only way to learn how to do research is to embark upon it and that without 
such a process the knowledge acquired by the theoretical study of research proced- 
ures ls ephemeral and jejune. But this view forgets the position in which the new- 
comer finds himself. It ignores the fact that psychological research techniques are 
highly sophisticated, are not so easily acquired by contagion as in many other fields 
and exist ın considerable variety. It does not, moreover, take cognisance of the 
fact that, for many new research workers, it is the existence of techniques as well as 
of problems which often provides orientation and a conceptual framework for their 
proposed studies. 
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The need for the guidance which such a book can provide would almost certainly 
be granted by those who have numbers of higher degree students passing through 
their hands, yet the attempts to provide it have been rather few. Of these, the 
present is the best which has yet appeared. It deals in a co-ordinated manner with 
the things one would expect, with the processes of planning and design in fair detail, 
with the techniques of instrumentation, verbal, observational and experimental, 
with sampling and with analysis, interpretation and reporting of results Through- 
out, the hand of experience is clearly revealed and the discussion is informed, up to 
date and most clearly laid out. Yet, even so, if the purpose of the book was to 
inform and guide the beginner it might still have been better. It tends, as do all 
books in this field, towards a discussion of what research is about, as 1f it were a 
discipline in its own right, rather than concerning itself with how to do research, 
insofar as it ıs ever possible to describe this. This is, of course, a simplification and 
generalisation which in some detail is unfair to the authors. To several of their 
chapters they have added supplements which deal with the ' how ' rather than the 
‘why ' and these chapters, and not only their supplements, are among the best ın 
the book Yet, too much of the 400 pages of text is taken up with discussion 
which is excellent for the relatively sophisticated but which 1s inappropriate to the 
person seeking fairly specific guidance during his novitiate. One suspects at times 
that the writers wanted to present a straightforward yet informed guide book to the 
neophyte but were deterred from writing the whole book in this way by the fear 
of being thought unscholarly. If this were so, 1t was an unnecessary fear both from 
the viewpoint of the subject matter and from the standing of the authors themselves. 
Had they written more specifically a guide book in the way indicated, they would 
surely have found space to deal, inier alia, with questionnaires, to which so many 
beginners are attracted, with a treatment of non-parametric techniques in sufficient 
detail to give the newcomer some orientation and, surprising omission, with factor 
analysis in outline and with the types of study amenable to it. If space was short 
they could well have omitted the twenty-five pages of standard tables. 


But these observations, notwithdstanding, the book remains the best of 1ts type 
and should certainly be 1n all libraries, in the hands of those sophisticated enough to 
Skip some parts and to benefit from others and with anybody else who 1s not deterred 
by the rather high price ticket which American books seem to attract in this country 


G E. R. BURROUGHS. 


VENESS, THELMA (1962). School Leavers. London: Methuen and Co., Ltd., 
pp. xxv-+252, 25s. 


This book is concerned with young people's outlook, chiefly on their own 
personal future What do they expect from lıfe in its two main spheres, family and 
work? In particular, and this is the problem which first stimulated the inquiry, 
what are their ambitions ? 


The study was carried out with 1,300 boys and girls in two counties, one in the 
West of England, and the other in the vicinity of London They were in grammar, 
technical and modern schools, with ages ranging from 13-plus to 17-plus, depending 
on leaving age. The methods employed were, to borrow a phrase from Wilkins, 
of the ' grape shot ’ variety : questionnaires, interviews, paired comparisons and so 
on The underlying assumption, shared by the reviewer, 1s that in certain kinds of 
research a multitude of relatively coarse approaches is more likely to yield a rounded 
picture than a single approach, however thorough. The key technique on which the 
author relied most heavily—though without omitting frequent checks from different 
sources—is the ' restrospective autobiography.’ The writer was asked to imagine 
himself at the end of his life, and review it from that perspective This technique, 
which was previously used by Stoetzel in his study of Japanese youth, produced a 
rich harvest of fascinating material. 
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Some aspects of ideas about marriage and the family may be cited by way of 
illustration. Current trends are reflected in the favourite age for marrying, between 
20 and 25. Although it 15 not directly stated, one might infer from the quotations a 
remarkable absence of the romantic notion of marriage (remarkable because the 
prevalent stress in the mass media): “ This job I stayed at until I got married at 
the age of 20 I had a good husband and we never quarrelled " It is mostly rather 
matter-of-fact. This also applied to the references to children ; there was Little to 
support the view that frustrated aspirations tend to be projected on to fantasy 
children, and scant emphasis on education as a goal. Very few mentioned separation 
or divorce, but a curious feature was the anticipated high mortality rate of spouses. 

Attitudes about work, knowledge of jobs and job choices are extensively dealt 
with, and this section of the book wi ll be found most useful by those concerned with 
the practical problems of school leavers. The author introduced an interesting 
analytical categorization of job-choice orientations in terms of ' tradition-,' ' inner- ' 
or ' other-directed,' which deserves further development. 

The findings concerned with ambitions have aroused most widespread interest 
and led to the vociferous airing of prejudices. A common reaction to the book was 
that it reveals the youth of today as dull and unenterprising ; had the results been 
in the opposite direction, the line would no doubt have been that modern youth is 
aiming too high, with inevitable later frustration. Here it will only be noted that the 
statistical analysis of ' ambitiousness ' items has as its outcome a scale on which 
technical school boys rated highest. The problem is of course,, a difficult one, and 
the author herself did have some doubts on this score. It is, perhaps, questionable 
whether any single scale can satisfactorily measure ' ambitiousness ' across the whole 
range from unskilled to higher professional work. Another factor, which should be 
kept in mind with regard to all the comparisons made between schools, is that a 
majority of the grammar school pupils investigated were somewhat older than the 
rest. 

These comments are only possible because extensive and detailed tables are 
provided. Readers tempted by the lively style to skip perusal of the tables should 
resist the temptation—they contain much of the meat. Altogether the book has a 
great deal to offer to everybody concerned with young people, but perhaps most of 
all to educationists (a sample of their products) and to youth employment officers 
(a warning that what they are apt to regard as raw material is already partly 
processed.). A G. JAHODA. 


Vycortsxy, L. S. (1962). Thought and Language. M.I.T. Press and Wyley. 


This is an English translation of a book written getting on for thirty years ago 
which appeared again in Russian only as recently as 1956. It shows remarkable 
insights into the nature of thinking, and, in its essentially functionalist approach, is 
an appropriate work to put before the English reader today. 

After presenting his problem, which is the growth of thought as shown in the 
construction of concepts, Vygotsky then discusses Piaget’s and Stern’s views of 
language development. The comment upon Piaget is, of course, based only on his 
earlier works and the publishers include with the book a separate paper of some 
thirteen pages, setting out comments made by Piaget in 1962. Vygotsky is critical 
of the extent to which Piaget’s theory leans on the concept of maturation. However, 
Vygotsky’s own contributions form the really important parts of the book, particu- 
larly as set out in Chapters 5,6 and 7. In Chapter 5 he describes how conceptual 
grouping by children grows from heaps having no objective basis, through com- 
plexes with no logical basis but possessing some of the qualities of concepts, finally 
to true concepts. In the next chapter, on the development of scientific concepts, he 
has much to say to the educationalists, in particular contrasting spontaneous with 
non-spontaneous concepts. The latter he exemplifies from school work in science 
and modern language learning, and shows how the two types of concept tend to 
come together in the course of learning. Finally, there is a thoughtful discussion 
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of thought and word, in which the growth of speech and thinking are considered 
together and in which Piaget’s views on the socialising of egoentric speech is com- 
mented upon and checked and criticised by empirical study. 

In ali the book 1s a ‘ must ' for workers in the field of the growth of children’s 
thinking and language Those who know the publication by Piaget and Inhelder, 
La genese des siruciures logiques elementaires, will also recognise how similar 1s the 
picture presented by Vygotsky of the growth of concept formation to that set out 
by the Geneva School in their publication The book is, however, more than a 
psychological treatise. Much of 1t offers a method of education, and so should be 
read by research workers 1n this field. E A. PEEL. 


WiISEMAN, S. (Edit. 1961). Examinations and English Education. Manchester : 
Manchester University Press, pp. xx+ 188, 21s. net. 


In 1959, the subject chosen by the School of Education in Manchester University 
for its annual public lectures was the problem of examination. There were, we are 
told, several reasons for this choice—the growing interest of secondary modern 
schools in examinations of an external type, the feeling that the time was ripe for 
taking stock of post-war changes in examinatious which affect either primary or 
secondary schools, and finally, the recent developments in the techniques of examin- 
ing and the need for relating them to the purposes the examination seeks to fulfil. 
The present volume, edited by Professor Wiseman, is an expanded version of the 
lectures delivered on these topics, with an additional chapter from Professor Oliver 
dealing more especially with the recommendations of the Crowther Report. 

After a short introduction, the series opens with a ' historian's view of examina- 
tions’ by Mr. Norman Morris. This is followed by lectures on examinations in 
relation to the grammar school, the modern school, and the primary school, respec- 
tively, by Dr.Petch (Secretary to the North Universities Joint Matriculation Board), 
Mr. Lucia (formerly Secretary to the Associated Examination Board), and Dr 
Wiseman. Dr. Wiseman also summarizes the methods and results of assessing the 
efficiency of examinations ; and Professor Oliver discusses ‘examining general 
education,’ and then winds up the course with a paper on the problems of education 
and selection. 

All five contributors are conscious that both selection and examinations have 
their limitations and their drawbacks, and therefore, call not only for some defence 
if they are to be retained, but also for certain improvements if current criticisms are 
tobe met They are, however, equally convinced that in recent years criticism has 
been pressed too far. At bottom, as Dr. Wiseman observes, the objections to 
selection, and particularly to selections by means of intelligence tests, are “ social 
and political in origin rather than educational and psychological.” But the familiar 
arguments have been strongly supported by sociological writers, like Dr. Floud and 
Dr Halsey. In their book on Social Class and Educahonal Opportunity they assure 
us that ‘‘ measured intelligence is well known to be largely an acquired character- 
istic," and therefore unfair to children of the “ unprivileged classes " But this, 
says Dr. Wiseman (quoting Dr Warburton),-is rather like declaring that “ the 
English Channel is well known to be wider than the Atlantic Ocean" No one, be 
adds, who has really studied recent research on the subject ‘‘ can possibly deny the 
very substantial genetic component in tested intelligence " ; and, as actual records 
show, the introduction of such tests has undeniably helped to do fuller justice to 
those who are socially or culturally handicapped ‘‘ The intelligence test musi be 
used if we are to get anywhere near equality of opportunity ; and we would be more 
successful 1n reaching this objective if the test had greater weight.” 

Professor Oliver defends much the same point of view. From Plato onwards, 
so he reminds us, philosophers have maintained that educational systems have a 
dual function—to train and to select; and there is the inevitable temptation to 
over-emphasize one function to the neglect or impairment of the other. Current 
discussion, too, has tended to forget that efficient selection implies and facilitates 
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social mobility. Accordingly, in the interest of the community as well as of the 
individual, it 1$ to be hoped (he says) that ill-informed attacks on examinations and 
selection tests will be stoutly resisted. The Crowther Report “ tries to escape the 
necessity of selection by reviving the supposed distinction between seléction and 
qualification.”” The G.C.E has come to serve both purposes; but the Report 
proposes to revert to a separate examination for each. Professor Oliver holds that 
the distinction 18 unrealistic : 1f the number qualified exceeds the number of places, 
we must select; and we ought, so far as we can, to select the best. Hence, he 
argues, the Crowther Committee ‘‘ would have done better to address itself to the 
unprovement of the G.C.E —the only public examination at this level which has 
achieved national currency.”’ 

How such umprovements might be achieved is admirably outlined by Dr 
Wiseman Hitherto, he believes, “ educational psychologists have tended to 1gnore 
the wider educational implications of their work, while the teacher-examiners of the 
traditional boards do just the reverse ” : they overlook the need and the possibility 
of increasing the sheer efficiency of their techniques In the United States, '' com- 
parable boards have research sections engaged full-time on such problems "; and 
he rightly urges that there is pressing need for similar studies over here He sums 
up the whole book by saying that ''good examinations are both useful and desirable— 
without them education would be much less effective than itis: let us, therefore, 
make sure that the examinations we do use are efficient in the fullest sense.” 


CYRIL Burr. 


WirsoN, HARRIET (1962). Delinguency and Child Neglect. George Allen and 
Unwin, pp. 195, 25s. 


There have been numerous studies of the relationship between family back- 
ground and juvenile delinquency, but few of these have been studies 1n depth of the 
' problem family ' in which the children suffer some measure of neglect. Dr. Harriet 
Wilson's thoughtful contribution to our knowledge 1n this field is, therefore, to be 
welcomed and merits careful study. Basing her book on an enquiry into a group of 
problem families in an industrial city, she sets out to define what is meant by the 
' problem family ' and to discover to what extent such families are the breeding 
ground of delinquency. 

After discussing recent trends ın studies of juvenile delinquency (she surprisingly 
omits reference to some important work of relevance, for example that by Stott, in 
this country, and Hewitt and Jenkins in the United States) and making some 
interesting comparisons between British and American culture-patterns, Dr. Wilson 
goes on to describe the patterns of living of fifty-two problem families, selected as 
her sample on the basis of performance-inadequacy, under the headings of solvency, 
health and education. She finds that the most disabling feature of the problem 
family is 1ts social isolation, while emphasizing that these famulies are not, as is 
commonly assumed, a homogeneous entity. Delinquency is part of the pattern of 
juvenile behaviour among families exhibiting signs of child neglect, the main factors 
dominating the children’s lives being poverty and weak relationships with their 
parents and other adults. The chief characteristic of the juvenile delinquent from 
the problem family 1s his complete unawareness of the moral significance of his 
actions. 

The book is refreshingly free from sociological jargon, and tries to avoid un- 
warranted generalisations. Dr. Wilson has kept an open mind throughout her 
enquiry and is, on the whole, careful about methodological considerations, though 
the children have been studied less gystematically than the parents. For example, 
a number of conclusions about the level of the children's intelligence are drawn from 
data which are incomplete and, being partly based on group tests grven at the 
infant school stage, hardly reliable; and the author's reason for obtaining teachers' 
behavioural assessments in an unstandardized form seems inadequate. Further- 
more, can one justifiably infer the presence or absence of maladjustment from the 
correlation between I.Q., attendance, and educational attainment (p. 73) ? 
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A shght re-arrangement of the material might have made the main findings and 
conclusions stand out more clearly from the mass of detail; and some of the tables 
are unnecessarily obscure — There 15, however, likely to be little disagreement with 
Dr. Wilson's main conclusions and with her practical proposals in the final chapter, 
where she advocates, as preventive action, measures which would help the father 
with a low wage and large family (so that he would no longer be financially little 
better off working than if unemployed) as well as aid those families permanently 
dependent on assistance from the state; realistic family planning methods; an 
increase in the number of nursery schools; and the provision by local authorities 
of specialized supportive help of a personal kind. MAURICE CHAZAN. 


YATES, AUBREY J. (1962). Frustration and Conflict. London: Methuen, 
pp. +236. 28s. 


The author states that this book is not intended to be “ an original contribution 
with respect to theory,” but rather “ a systematic examination of the accumulated 
knowledge in the areas of frustration and conflict.’ These topics have been 
extensively investigated by psychologists, both clinically and, in more recent years, 
experimentally. Dr. Yates is concerned primarily with work of the latter kind, 
but a most useful bibliography of well over three-hundred items shows how widely 
he has, nevertheless, cast his net. 

The book comprises a brief introduction and five main chapters The first is a 
very thorough review and critical analysis of the work of Maier and his colleagues 
on frustration and fixation. The author frequently comments on the neglect of 
Mauer’s findings by other psychologists and remarks. '' There 1s no doubt that the 
lasting value of his work has been obscured by sheer prejudice and a prior: con- 
victions." He, nevertheless, admits that Maier's results and theoretical interpreta- 
tions are still difficult to reconcile with other contemporary approaches. The second 
chapter uds a detailed examination of the frustration-aggression hypothesis of 
Dollard, Miller, et al. and of the very considerable number of experiments which this 
has provoked The author's verdict is that work in this field has proved '' extremely 
profitable." Next comes an account of studies on frustration and regression It 
includes an exhaustive critique of the famous experiment by Barker, Dembo and 
Lewin. Dr. Yates notes how extraordinarily few attempts have been made to 
repeat their findings, a need which is surely pressing, since the original enquiry was 
conducted without a control group and the statistical treatment of data was often 
unsatisfactory. The fourth chapter is on confict. Short sections are devoted to the 
work of Luria, Lewin and Cattell, but the main analysis is concentrated on the 
extensive work of Miller and his colleagues. Some most ingenious experiments are 
discussed in this section, including a particularly brilliant tour de force, by Murray and 
Berkun. 

The final chapter is entitled “ Frustration, Conflict, Learning Theory and 
Personality." Here Dr. Yates tries to draw the strands together, and at once the 
influence of Hulhan learning theory is apparent. Recent efforts to incorporate 
‘frustration ' into learning theory as an intervening variable are described, as are 
many experiments relating to these attempts. Eysenck’s theory of personality, 
itself of course, very considerably influenced by Hullian theory, is also favoured, 
and hypotheses are put forward linking frustration and conflict with neuroticism 
and introversion-extraversion. 

As a whole, this book 1s to be highly recommended. Whatever one may feel 
about the author's theoretical leanings, serious attempts to test rigorously theories 
about frustration and conflict are of very recent origin and should be encouraged. 
Dr. Yates’ foremost contribution has been to provide a careful, objective and wide- 
ranging digest of this interesting, and often exciting, current research For the 
educationalst, a useful feature is his detailed consideration of the effects of punish- 
ment, which are shown to vary greatly under different conditions. More generally, 
this book provides an essential basic reference both for those contemplating serious 
research into frustration and conflict and for all advanced students of these topics 

G. W. PILKINGTON. 
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THE REDUCTION OF SOCIOECONOMIC BIAS IN 
INTELLIGENCE TESTING 


Bv R. S. MACARTHUR anD W. B. ELLEY 
(University of Alberta) 


SUMMARY. This study was designed to investigate the extent of socio- 
economic bias in selected intelligence tests, with a view to identifying and 
constructing instruments which might better estimate the intellectual potential 
of pupils from deprived cultural and educational backgrounds. For such 
pupils the conventional intelligence test ıs unsuitable as a predictor of later 
educational progress if adaptive treatment reduces the effects of their educa- 
tional handicaps. 

Nine promising culture-reduced tests and subtests were administered to 
a representative Edmonton sample of 271 twelve and thirteen-year-old children, 
and the results analysed in relatioh to measures of socio-economic status, 
verbal intelligence, achievement and school marks. City-wide testing surveys 
conducted during the previous four years provided data on the same children, 
making longitudinal comparisons possible. The results supported the hypo- 
thesis that it is possible to measure a broad component of intellectual ability 
with significantly less cultural bias than is found in the conventional intelligence 
test. Raven’s Progressive Matrices proved the most useful test in the battery 
since it showed high g loading, consistent and minimal relation with socio- 
economic status, no evidence of cultural bias by items, and moderate correla- 
tions with school marks. The Cattell test of g and the Lorge-Thorndike 
Figure Analogies subtest showed similar promise, since they met most of the 
criteria of the study in shorter testing time. 


I.— INTRODUCTION 


INTELLIGENCE tests are frequently evaluated by mechanical investigation of 
their predictive validity in relation to academic criteria. Usually underlying 
such an empirical approach is the assumption that only one or two pre-arranged 
and rather standard treatments will occur between test and criterion judgments. 

Cronbach and Gleser (1957) have drawn attention to a distinction between 
fixed-treatment and adaptive-treatment situations in decision-making based 
on the results of mental tests. In the former case, treatment is specified a priori, 
and maximum utility is related to predictive validity coefficients ; individuals 
are selected or placed simply in accordance with their promise in relation to a 
predetermined criterion with predetermined treatments, and little regard for 
the qualities underlying the test is necessary. In educational prediction, the 
intervening treatments are usually limited by a relatively fixed curriculum and 
relatively standard teaching techniques. 


In adaptive teaching decisions, however, atypical individuals may be 
assigned quite different treatments according to their present status and 
indicated potential. In en educational setting, immigrant children may be 
given supplementary English coaching, children from deprived backgrounds 
may benefit from remedial reading, and sensory-handicapped children may be 
offered special aids. Educational philosophies which emphasize the develop- 
ment of potential to its maximum require serious consideration of adaptive 
rather than fixed treatments, and to aid in such adaptive treatment, the use of 
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mental tests which can provide estimates of potential ability, divorced as much 
as possible from inequalities in cultural background. The conventional verbal 
test of intelligence, designed as it is for subjects raised within a specific cultural 
range, has limited suitability for identifying potentially gifted children from 
deprived or foreign backgrounds, or with restricted formal education. The 
writers of this article suggest that various non-verbal tests for use in adaptive 
treatment settings have received insufficient research attention. 


It is further suggested that the choosing of a test to help identify potential 
for attainment of distant and geneial criteria, assuming the possibility of 
intervening treatments appropriately adapted to realize that potential, can be 
facilitated through the ‘ construct validity ’ approach advocated by Cronbach 
and Meehl (1955). The evaluation of tests of educability should place more 
emphasis than is customary on theoretical analysis of their relation to the 
concept of intelligence as developed by experience, with proportionately less 
emphasis on their ‘ cold-blooded ’ predictive validity under limited fixed treat- 
ments. If potentially suitable tests can be examined in the light of hypotheses 
generated from an adequate theory of intelligence, some progress may be 
anticipated in the development of culture-reduced tests suitable for classifying 
and selecting children from diverse cultural backgrounds. 


It was the purpose of the present investigation to identify tests or subtests 
which, for a sample of 12 and 13-year-old pupils from a variety of socioeconomic 
backgrounds in Edmonton, Canada, could on the one hand, be regarded as 
valid measures of intellectual ability, while on the other hand, could demonstrate 
a minimum of socioeconomic bias. Parallel studies with children from different 
ethnic and language backgrounds are now in progress (MacArthur, 1962). 


IJ.— PREVIOUS RESEARCH. 


Binet himself realized that his measuring scales were sensitive to ‘ scholastic 
and extra-scholastic acquisitions’ as well as ‘intelligence pure and simple’ 
(1916). Further analyses by Stern (1914) and Burt (1921) showed significant 
social class differences, particularly in the more verbal items of the Binet scales, 
while Gordon (1923), Wheeler (1942), Baldwin, e£ al (1930) and others have 
shown that the conventional intelligence test was unsuitable for children without 
formal education, from isolated communities or from rural districts in general. 

Amongst the causal interpretations made of social class differences in 
measured intelligence major emphasis has usually been placed on hereditary and 
environmental influences. While not denying these, 1t is considered that the 
importance of test bias requires further study. Eells (1951) has produced strong 
evidence to support the contention that bias in intelligence test items is common, 
and that it tends to favour upper-middle class children. From his Chicago 
surveys, he concluded that verbal tests and items show significantly more 
variation in discriminating between social classes than do non-verbal. 

Havighurst, ei al (1944, 1947) also demonstrated that the extent of differ- 
entiation amongst social classes depends on the trait measured, the greatest 
discrepancies appearing in verbal and numerical tests, the smallest in spatial, 
mechanical and other non-verbal tests. 

These research studies are, unfortunately, open to the criticism made by 
Lorge (1952) that analysis in terms of social levels reveals nothing about the 
abilitv of tests to differentiate between intellectuallevels. Both dimensions are 
important in test evaluation. Moreover, Eells used only one non-verbal test 
whose reliability was suspect (Tyler, 1953), while Havighurst's studies were 
based on relatively small samples. 
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* Culture-fair ' tests such as those of Cattell (1940, 1958) and the Davis-Eells 
Games (1953) have not adequately fulfilled thei: designers’ intentions according 
to studies described by Anastasi and Cordova (1953), Drake (1955), Smith (1956) 
Stilwell (1958) and Love and Beach (1957), although the evidence has sometimes 
been more favourable (Marquant and Bailey 1955, Cattell 1958, Fowler 1955). 
Further evaluation of such tests using a construct validity approach i iu a variety 
of settings seems necessary in order to assess the usefulness of such measures. 

Since the present investigators are seeking to identify tests suitable for a 
variety of cultural and educational backgrounds, it was considered that Cattell’s 
approach to the problem of reducing cultural differentials by holding them to a 
minimum would be more suitable for this study than the Davis-Eells approach 
of holding them constant. On such an assumption, this investigation was 
designed to study tests similar to Cattell's in an attempt to identify test items 
which tap fundamental processes of intellectual ability while holding environ- 
mental differences to a minimum. 

No illusions are held about the possibility of eliminating cultural influences 
from a pencil-and-paper test. Every culture stimulates the development of 
certain abilities at the expense of others, and the idea of aa intelligence test 
being culture-free in the sense that it measures the products of heredity quite 
apart from the impact of cultural experience cannot be taken seriously. The 
present study aims rather to identify and evaluate tests which reduce the 
dependence on acquired knowledge in order to make possible the improvement 
of mental testing for children whose backgrounds do not fall within the required 
range assumed by the conventional verbal test. 


IIL.—THE CONCEPT OF INTELLIGENCE. 


Ove: the past several decades the concept of intelligence as general innate 
ability has been subjected to a barrage of criticisms. Factor analysts such as 
Thurstone (1938) and Guilford (1956) have disparaged the idea of a general 
factor; environmentalists such as Halsey (1959) and the Iowa investigators 
Stoddard and Wellman, 1940) have questioned the notion that intelligence is 
predetermined or fixed ; developmental research pioneered by Piaget (1947) 
suggests that intelligence is intimately bound up with the continuous interaction 
between organism and environment, while the possibility of critical periods and 
irreversible influences in mental development is now seriously held (Hebb, 1958). 

One beneficial effect of these criticisms has been to draw attention to the 
distinctions drawn by Hebb (1949) and Vernon (1955) amongst Intelligences 
A, B and C. Nevertheless, behavioral definitions of intelligence are as contro- 
versial as ever, and the operational approach of defining intelligence in statistical 
terms appears the more profitable course of action. Such a position was adopted 
in forming the conceptual framework of this study. 

The hierarchical conception of intelligence described by Burt (1949) and 
Vernon (1950) is regarded by the present authors as the most useful model for 
assessing intelligence in diverse cultural settings. The general factor is a 
crucial construct for assessing the extent to which culture-reduced tests are 
sampling the common denominator of abilities required for success in a variety 
of mental tests. It looms so laige in psychometric surveys of heterogeneous 
children that it is difficult to ignore, while the advantage of distributing its 
variance over a number of lower-level factors has yet to be demonstrated even 
with culturally homogeneous children. In predicting to distant and general 
criteria, when adaptive treatments are possible, ‘pure’ factor tests, are seldom 
superior to general intelligence tests, while the amount of time and money 
consumed on the former is an obvious administrative disadvantage. 
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The question of the innateness of the general factor can be circumvented 
by insisting only on its comparative stability in relation to lower-order factors. 
Evidence to support such stability comes from various sources. Woodrow 
(1938) demonstrated that the general factor was relatively unaffected by differ- 
ential practice in a thirty-three variable analysis, while group factors changed 
considerably. Hofstaetter’s (1954) factor analysis of the longitudinal mental 
development of children in the Berkeley Growth Study revealed a stable general 
factor increasing in size between the ages of 4 and 8, and remaining steady 
throughout pre-adolescence. Tests with high g loadings and negligible group 
factors have shown less susceptibility to cultural change (Cattell, 1958) and 
other environmental influences (Nisbet 1953, Burnett 1958). 


Theoretically, the stable g factor is compatible with the hypotheses of 
Hebb (1949) and Ferguson (1956) which suggest that abilities are developed 
primarily by positive transfer from the old learning to the new. Early superior- 
ity in processing and integrating skills should lead to increasingly broad and 
cumulative superiority of intellectual development in a variety of fields. 
Vernon (1960), too, sees support for the general construct in the developmental 
approach for '" those who, early in life acquire a larger stock of perceptual 
schemata and verbal habits are better able to build up more complex and more 
flexible schemata necessary for conceptual thinking " (p. 41). So the general all- 
inclusive logical structures accumulate and proliferate, producing a broad general 
factor which, although difficult to define behaviourally, continues to account for 
most of the statistical variance in any battery of ability tests. 


In a comprehensive review of theories and research on intelligence, Hunt 
(1961) draws attention to the compatibility of apparently diverse interpretations 
of intelligence, which seem to be best explicable in hierarchical terms. 


`“ A hierarchical factor structure, with Spearman's g at the apex, as the 
English factor analysts have tended to conceive of intelligence is precisely what 
would be expected from considering abilities as the accumulative effects of the 
transfer of training or learning in varied situations . . . Piaget's conception of 
accommodation corresponds to the change in function that occurs with training 
or learning, and Piaget's conception of assimilation corresponds to the notion of 
transfer of training ” (Hunt, 1961, p. 359). 


This general factor at the top of the hierarchical model, is essential to the 
present study. To qualify as a measure of an all-embracing intellectual ability, 
while relatively free from the vagaries of environmental circumstances, a test 
must show a substantial loading on the general factor at the top of the hierarchy. 
Culture reduction is of little value if the test does not sample extensively that 
common denominator or ' transfer basis ' of the fluid set of abilities we under- 
stand by intelligence. 


IV.—EXPERIMENTAL PROCEDURE. 


Nine tests and subtests were selected as promising culture-reduced measures 
of general intelligence. In May, 1960, these were given to 271 pupils constituting 
a stratified sample from the Grade VII population in the public schools of 
Edmonton, a rapidly expanding central Alberta community-of nearly 300,000 
people. The variables of provincial examination averages, socioeconomic 
status, geographic location, school type and residential mobility were con- 
sidered in selecting the sample, and a comparison with a random selection from 
the same population in terms of age, sex, occupational status and Laycock 1.Q., 
showed no significant differences. 
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The potentially useful culture-reduced tests selected were : 
( 2 Raven Standard Progressive Matrices (1956). 
Cattell Test of g “ Culture Free." Scale 2. (1958). 
P Lorge-Thorndike Non-Verbal Intelligence Test. Level 4 (1957). 
Lorge-Thorndike Figure Classification Subtest. 
(5) Lorge-Thorndike Number Series Subtest. 
(6) Lorge-Thorndike Figure Analogies Subtest. 
m Holzinger-Crowder Uni-Factor Test (1952) Figure Changes. 
(8 Holzinger-Crowder Uni-Factor Test (1952) Mixed Series. 
California Test of Mental Maturity (CTMM) Non-Language, Short Form, 
Elementary (1957). 
Other variables for which data were gathered were : 
(10) California Test of Mental Maturity, Language (1957). 
(11) Holzinger-Crowder Uni-Factor Spatial Test (Boots) (1952). 
(12) Occupational Index Score. 
(13) Home Index Score. 
(14) Year-end marks on teacher-made tests of attainment. 


In May of the previous year, 1959, the same pupils had been administered 
the California Achievement Battery, Junior High (1957), whose subtests, listed 
below, were used to help identify factors in the factor analytic studies. 

(15) California Reading Vocabulary. 

16) California Reading Comprehension. 

17) California Reading Total. 

18) California Arithmetic Reasoning. 
California Arithmetic Fundamentols. 
20) California Arithmetic Total. 

21) California Language. 
22) California Spelling. 
23) California English Total. 


Further data used for comparative purposes were : 
(24) Laycock Mental Ability Test (1935)—raw scores obtained two yeais 
earlier. 
(25) Raven Coloured Progiessive Matrices (1947)—administered in May, 1956. 
(26) CTMM Primary (1953)—administered in May, 1956. 
The CTMM Elementary (1957) was also analysed by subtests. 
(27) CTMM Spatial. 
(28) CTMM Logical. 
(29 CTMM Number. 
(30) CTMM Verbal. 


The occupational index was obtained by assigning each pupil's statement 
(checked in a local directory) a standard score from Blishen's (1958) Canadian 
Socio-Economic Scale, based on Dominion census data, and evenly weighted 
according to income and years of training required. The Home Index Scale 
consisted of twenty items, adapted for local conditions from a scale devised by 
Gough (1949). The purpose of these measures was not to isolate the crucial 
variables operating to accelerate mental maturity, but rather to indicate when a 
number of these stimulating correlates was present and operating directly or 
indirectly, and when their importance was reduced. 

On the basis of previous research, and the theoretical position adopted, 
the following predictions were made : 


(a) The nine culture-reduced tests will show g loadings as high as those of the 
conventional verbal tests of intelligence. . 
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(b) The nine culture-reduced tests will show negligible loadings on those group 
factors which appea1 to depend heavily on cultural acquisitions. 

(c) Socioeconomic status of father will show greater correlations with scores 
on the conventional verbal tests than it does with scores on the culture- 
reduced tests. 

(4) Over a period of four years (1956 to 1960) children from upper socio- 
economic levels will improve their status on the CTMM more than on the 
culture-reduced Progressive Matrices. . 

(e) An item analysis of the most promising culture-reduced tests will show that 
upper and lower thirds, by socioeconomic status, will succeed in similar 
ratios on each item. On the CTMM Verbal Test, discrimination indices 
will show greater variation from item to item. 

Additional evaluative criteria of relationship with school achievement, reliability 

and suitability for classroom use were also examined. 


V.—ANALYSIS OF RESULTS. 


Three socioeconomic variables were used in the analyses—occupational 
T-score, home index raw score, and a combination of the two measured with 
equal weights, yielding what was termed a socioeconomic index. Preliminary 
analysis on the two component sub-scores showed an intercorrelation of 0-61 
which is close to Gough’s corresponding coefficient of 0-65 (1949). The pattern 
of correlations with other variables was almost identical for the two measures 
(rho==0-87) suggesting that they sample highly similar correlates of mental 
ability. A split-half reliability coefficient of 0-77 was obtained from the Home 
Index questionnaire. 

(a) and (b) The two hypotheses concerning the abilities sampled by each 
test were investigated by performing two factor analyses using Hotelling’s 
Principal Components Method. The first analysis was performed on twenty- 
one variables (Tables 1 and 2) and included many subtests. The second 
concentrated on the eleven total test scores (Tables 3 and 4). 

From the first matrix six principal components were extracted* but the 
last was discarded because it failed to reach significance level, according to 
the tests of McNemar, Burt, and Guilford and Lacey. The five significant 
factors were then rotated using criteria of positive manifold, maximization of the 
general factor and psychological meaningfulness. After four rotations a 
compelling psychological factor pattern emerged (Table 2). The factors were 
interpreted as general ability (g), a verbal educational factor (v : ed), socio- 
economic status (se), numerical ability (n) and, possibly, speed (sp). The fact 
that no spatial factor appeared in the analysis may be due to the low ceiling of 
the CTMM Spatial Subtest, or the strong speed element in the Holzinger- 
Crowder Test. The eleven total test scores were analysed similarly yielding 
three clear factors labelled g, v : ed and n. A comparison of the factor patterns 
of the tests common to both analyses showed a mean loading discrepancy of 
only :06, which suggests a high degree of similarity in the common dimensions 
obtained. The rotated factor solutions indicate that the tests most highly loaded 
with the general factor were Cattell's Test of g, the Progressive Matrices, and 
the Lorge-Thorndike Figure Analogies Subtest. The remaining six culture- 
reduced tests showed less promising g saturations, or significant loadings on 
verbal or numerical factors. . 

* Communalities were estimated by calculating the squared multiple correlation of 
each variable with the n-l remaining variables ın the batteiy, as suggested by Wngley 
(1959) and Kaiser (1960). Iteration was performed after replacing the communality with a 


value increased (or decreased) by one-and-a-half times the discrepancy between obtained 
and estimated values, until convergence occurred, 
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7 TABLE 2 


ROTATED FACTOR SOLUTION AND COMMUNALITIES FOR TWENTY-ONE VARIABLES, 


(Loadings below -20 omitted) 














Test Test Name L I, | I, | m Y, La 
No, 

1 Prog Matrices ........ 708 — — — — 542 
2 Cattell R. 4er a 754 — — — 222 626 
3 | L-Th. Fig. Class........ 582 = = = — | 355 
5 L-Th. No. Senes. ...... 547 — — 395 — 486 
6 L-Th.Fig. Anal  ...... 738 — — — — 553 
7 Holz -Crowd. Fig Ch. 498 — — 211 369 440 
8 Holz.-Crowd. Senes ....| 457 214 — 453 303 554 
11 Holz.-Crowd Spatial ....| 397 — — — 396 326 
12 Occ. Status ............ 251 — 683 — — 583 
13 Home Index ........... 247 205 669 — — 602 
15 Reading Vocab. ........ 335 743 — — 264 747 
16 | Reading Comp 501 618 ae = — | 681 
18 Arith. Reasoning 460 341 — — — 369 
19 Arith. Fund. 450 436 — 442 — 629 
21 Language 417 589 — — 299 641 
22 Spellmg .............. 201 618 — — — 443 
24 Laycock ..........ess. 676 509 — — — 735 
27 CTMM Spatial ........ 612 — — — — 428 
28 CTMM Logical ........ 660 — — — — 476 
29 CTMM Number  ...... 639 201 — 331 — 563 
30 CTMM Verbal ..... |] 458 658 — — — 694 

96 of Common Variance ...... 51-18 | 25-23 | 9:11 7-80 | 6-68 |100 00 

96 of Total Variance .......... 27.94 | 13-78 | 4:97 4-26 | 3-65 | 54-60 

TABLE 3 


CORRELATION MATRIX ror ELEVEN TOTAL TEST SCORES. 
(Decimal Points Omittted.) 





(800) 
629 
733 
713 


(874) 
598 
550 


(860) 


642 (741) 





* Final estimate of the communality. 
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TABLE 4 


ROTATED FACTOR SOLUTION AND COMMUNALITIES FOR ELEVEN TOTAL TEST SCORES. 
(Loadings below -20 omitted). 


Test Name 


Prog. Matrices 
Cattell g 
Lorge-lhorndike 
Holz.-Crowd. Fig Ch. 
Holz.-Crowd. Series 


% of Common Variance .... 
% of Total Variance 





(c) To investigate the third hypothesis, the combined index of socio- 
economic status was correlated with each test, and the difference between 
coefficients tested for significance. Table 5 shows the correlations of socio- 
economic status with the nine culture-reduced tests in the left column and four 
verbal tests across the top. Significance levels of difference between coefficients 
are found in the body of the table. 


Only three of the observed differences failed to reach the -05 significant 
level, a finding which supports the hypothesis that the culture-reduced tests 
selected show less differentiation between socioeconomic levels than do the 
conventional verbal tests. The three tests found to have highest g loadings 
showed satisfactory linearity of regression and so can be considered to meet the 
third criterion of culture-reduction quite adequately. 


TABLE 5 


SIGNIFICANCE LEVELS OF DIFFERENCES IN CORRELATION COEFFICIENTS BETWEEN Socio- 
Economic STATUS AND SELECTED TESTS. 


CTMM Laycock CTMM 
Total Total Lang. 


Progressive Matrices.... 
Cattell Test of g 
Lorge-Th. Fig. Class 
Lorge-Th. No. Series 
Lorge-Th. Fig. Anal. 
Lorge-Th. Total 

CTMM Non-Lang. 

Holz -Crowd. Series 


Holz.-Crowd. Fig. Ch. .. 
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(d) The Progressive Matrices and CTMM tests were used to test the fourth 
hypothesis of change in relationship with socioeconomic status over four years. 
Since appropriate forms of the Matrices and CTMM had been administered to the 
sample in both Grades III and VII, it was possible to predict an increase in 
correlation with socioeconomic status for the more culture-loaded but not 
the culture-reduced test. The fact that the correlations between grade levels 
were similar for each test (-545 and -471, respectively) made the assumption of 
similar degree of overlap in the functions measured on both occasions appear 
reasonable, while the same coefficients were considered low enough to allow 
those socioeconomic variables associated with test scores to operate. 

Correlation coefficients with socioeconomic status were calculated for 
normalized T-scores on all four tests, and differences tested for significance.* 
Results showed an insignificant increase of 0-015 in the case of the Matrices, 
whereas the corresponding CTMM Total correlation increased from 0-215 in 
Grade III to 0-323, a difference which is significant at the -05 level. As 
predicted, the abilities sampled by the Matrices test appear less susceptible 
to the cumulative effects of those socioeconomic variables which influence the 
mental development of children between Grades ITI and VII. 

(e) To investigate the fifth hypothesis, that a culture-reduced test would 
show less variation than a culture-loaded test in terms of item discrimination 
indices across socio-economic levels, the Progressive Matrices and CTMM 
Verbal Test were selected as representative of each category. The percentage 
passing each item in high and low socio-economic thirds was calculated for each 
test, distributions of discrimination indices drawn up for each test, and their 
variances compared. Only items with difficulty indices between 0-10 and 0-90 
were included in the analysis. 

The variances of the two distributions for the Matrices and CTMM Verbal 
Tests were calculated as 16-32 and 35-28, respectively, which, when compared, 
yielded a significant F ratio of 2-16 with 28 and 35 d.f. As hypothesized, items 
in the verbal test show greater variation in their discrimination between some 
socioeconomic levels than those of the non-verbal test. Although these dis- 
crepancies may not mar the predictive validity of the test assuming fixed 
treatments, they suggest that the verbal test is less suitable as a measure of 
intellectual potential. The Matrices discrimination indices were found to be 
similar throughout a wide range of difficulty. 

The criterion of relationship with school marks and standardized tests 
was then investigated.  End-of-year examination averages over all school 
subjects for each of the thirteen classes in the study were correlated with each 
of the intelligence tests and subtests. Results from the California Achievement 
tests administered in the previous year were also correlated with the major 
variables, using the whole sample of 271 pupils. The median correlations for 
the thirteen school marks and the mean correlations for the three California 
Achievement subtests are included in Table 6. Although less efficient than 
the culturally weighted conventional tests in this ‘ cold-blooded’ predictive 
situation, the correlations of the better culture-reduced tests are substantial 
enough to indicate promise for adaptive treatments. 

Typical reliability coefficients reported from previous/research are included 
in Table 6. A check was made on the Progressive Matrices in view of its promise 
in terms of the other criteria. A split-half reliability co-efficient of 0-91 and a 
Kuder-Richardson coefficient of 0-83 indicated a degree of reliability very close 
to the average value reported in other studies at this age level. (Burke, 1958). 


= The sample was reduced from 271 to 243 to test this hypothesis due to incomplete. 
test data from Grade III. 
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All tests were considered convenient for classroom use, with the possible 
exception of the Holzinger-Crowder subtests, whose administration instructions 
were found to be tedious and repetitive, and time limits somewhat unrealistic. 
'The Matrices test was judged the most popular by pupils and its ease of admin- 
istration could certainly be regarded as an asset. 


VI.—DISCUSSION AND CONCLUSIONS. 


The main findings of the study have been drawn together in Table 6. In 
interpreting this table it should be remembered that, assuming adaptive rather 
than fixed treatment, the criteria of high g loading and low correlation with 
socioeconomic status should be accorded more weight than the present relation- 
ship with school achievement. 


Table 6 indicates that the Progressive Matrices best meets the criteria 
chosen for evaluation of a culture-reduced intelligence test. In addition to 
the high g loading and low correlation with socio-economic status, this test 
is a reasonably good predictor of school marks, has a respectable reliability, 
shows no change in relationship with socioeconomic status over four years, and 
gives little indication of variation in item discrimination across socioeconomic 
levels. Moreover, the test has face validity in that the progressive increase in 
difficulty makes it a tester of what it teaches and it appears to be little affected 
by the previous experience of the testee. 


The shorter Cattell Test of g and Lorge-Thorndike Figure Analogies 
Subtest rank next. Reorganization of instructions, lengthening, and item 
analyses might well improve these tests, particularly in relation to reliability 
and correlation with schoool marks, where they are weakest. The remaining 
culture-reduced tests showed less promise in this investigation although further 
study is currently being conducted on their potential values in relation to racial 
and language variables. 


TABLE 6 


SUMMARY OF SELECTED INTELLIGENCE TESTS EVALUATED AGAINST SIX CRITERIA. 


Mean r 
Acht. 
Tests 


B 
Lorge-Th. Fig. Class 
Lorge-Th. Series . 
Lorge-Th. Fig. Anal 
Lorge-Th Total .. 
Holz.-Crowd. Series 
Holz -Crowd. Fig.Ch 
CTMM Non-Lang. 
CTMM Lang. ... 
CTMM Total . 
Laycock Total 

















* Re-test reliability coefficient. 
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For further research, too, one might predict considerable potential for an 
unspeeded test which used the Matrices approach of testing what it taught, and 
which supplemented the abilities measured with the most discriminating figure 
analogy and perceptual series items, or variations thereof. 


In summary, it seems reasonable to conclude that as tests of intellectual 
potential, the conventional verbal intelligence tests examined in this study show 
discrimination against children from lower socioeconomic backgrounds. It has 
been shown that, for 12 and 13-year-old pupils : 

(1) Culture reduced tests sample the general intellectual ability factor as well as 

or better than conventional tests. 

2) Most culture-reduced tests show negligible loadings on verbal and numerical 

factors. 

) Culture-reduced tests show significantly less relationship with socio- 
economic status than do conventional tests. 

4) Aconventional verbal test (CTMM) showed significant increase in relation- 
ship with socio-economic status over four years, whereas the Progressive 
Matrices showed no change. 

(5) Verbal items from the CTMM showed greater variation in item discrimina- 

tion between social classes than did items from the Progressive Matrices. 


Generalizing further, the three most promising tests identified by the study 
could profitably be put to greater use in the estimation of intellectual ability 
for adaptive teaching purposes, and particularly in talent identification and 
classification of pupils from underprivileged and foreign language bckgrounds, 
for distinguishing between mental defect and reading difficulties due to other 
causes, for holding intelligence constant in research experiments where verbal 
ability is irrelevant, and generally, as a supplement to verbal intelligence 
measures in the routine testing of children from all socioeconomic levels. 
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THE RELATION OF PSYCHOLOGICAL TEST SCORES TO 
ACADEMIC BIAS 


By LIAM HUDSON 
(The Psychologtcal Laboratory, University of Cambridge) 


Summary. On entering the Sixth Form, English schoolboys choose between 
Arts and Science subjects. Effectively, this choice commits them to one career 
field or the other. The present research is aimed to discover whether psycho- 
logical tests reflect the academic bias of such boys towards Arts or Science 
subjects ; and whether they give any indication of boys’ aptitude for specialised 
work at university and in research. A battery of conventional psychological 
tests was given io a sample of 315 15 and 16-year-old schoolboys, drawn from 
five schools These boys had just finished the G.C.E. ' O' level examination, 
and stood a good chance of reaching university. 

Results show that the tests used do reflect academic bias surprisingly 
well. On the other hand, there are signs that the relation of tests scores to 
aptitude for university and research work will prove to be complex. Some of 
the most gifted young Science specialists, for example, have scores which bear 
no relation to the trend of results in the sample as a whole. The implications 
of these and other findings are discussed. 


I.— INTRODUCTION. 


On entering the Sixth Form, at the age of 15 or 16, the clever English 
schoolboy has to choose between Arts and Science subjects. The choice he 
makes determines to a considerable extent the eventual course of his career. 
If he becomes, say, a History specialist, he will find it difficult later on to change 
to Classics or French ; and even more difficult, if not impossible, to change to 
Mathematics or Biology. Obviously, the more relevant information he has 
before him when making this decision, the better. 


It is this choice which forms the context for the present research. Its 
ultimate aim is a series of tests which would show a 16-year-old (a) roughly 
what his own potentialities are, and (6) what demands would be made on him by 
various careers at university and in research. In order to develop and validate 
such tests, a series of longitudinal experiments are needed, in which individuals’ 
test scores as schoolboys are related to their achievements as adults. The 

resent paper is merely an interim report on the first stage of one such project. 
tt concerns the basic question: do test scores reflect a schoolboy’s bias of 
ability towards Arts or Science at the age when the crucial choice is being made ? 
Although this relation of test scores to academic bias will be discussed in detail, 
no attempt will be made to show that the relation is stable through time, or 
more generally that the tests have predictive validity. The criteria used are 
those appropriate to the Lower Sixth Form, and these are not grounds for 
vocational advice. 


Relatively little is known about the relation of test scores to academic 
bias. During the last twenty years, many attempts have been made to validate 
differential aptitude batteries, but these have met with little success (Super, 
1957; Horst, 1959; Anastasi, 1961). A notable exception is Crawford and 
Burnham’s (1946) report that the Yale Differential Aptitude yao has 
differential validity in predicting college grades. They also report (p. 242) that 
certain undergraduates have marked and persistent verbal/spatial biases of 
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test score. Similar biases have been reported by Bradford (1948) among 
British 11-plus candidates. The results in both cases imply that such biases of 
test score are related to biases of academic ability and interest; but these 
relations have not been substantiated (Burt, 1950). 


II.—MzTHOD. 
(1) The Sample. 


The present results are based on a sample of 315 15 and 16-year-old school- 
boys. These were drawn from five schools : two public boarding, one public day, 
and two grammar. All five have high academic reputations. In each case, the 
boys were tested as they entered the Sixth Form, having just taken the G.C.E. 
‘O?’ level examination. They were selected as having a reasonable chance of 
reaching university. The general level of their ability was high, the mean 
intelligence test score being only a few points lower than that found among 
Cambridge undergraduates (Hudson, 1960). 


(2) Subject-Groups and Criterion-Groups. 

In analysing the results, two systems of classification are used: Subject- 
Groups and Criterion-Groups. In the first of these, boys are grouped in terms 
of their special subject : Classics, Modern Languages, History, Physical Science, 
or Biology. This method is straightforward and yields a number of interesting 
comparisons. On the other hand, it gives little indication either of a boy’s 
aptitude for his special subject, or of his academic bias. The most useful 
analysis is one which relates test scores to criteria both of academic success and 
of academic bias. In the present case, this is done by using Criterion-Groups. 


Both the academic bias and the academic success of each boy were 
assessed from questionnaires, staff reports and G.C.E. ' O'' level results. Four 
Criterion-Groups were then selected, with twenty-five boys in each: ' Arts 
Specialists ' consisting of boys who are highly successful in Arts subjects and 
poor at Science ; ‘ Arts All-rounders " who are highly successful at Arts and 
adequate at Science; ‘Science-All-rounders’ who are highly successful at 
Science and adequate at Arts; and 'Science Specialists' who are highly 
successful at Science and poor at Arts. As they were restricted to boys who are 
highly successful in their own subject, these Criterion-Groups reflect both 
differences in academic bias and differences in academic level. Differences in 
academic bias are reflected in comparisons between the Criterion-Groups ; 
differences in academic success are reflected in comparisons between the 
Criterion-Groups and the remaining 215 boys in the sample. Four Criterion- 
Groups rather than five were used because genuine all-rounders proved 
relatively hard to find. The number of boys in each of the Subject- and 
Criterion-Groups is shown in Table 1. (It is important to note that the two 
systems of classification are independent, in the sense that a Classicist may be 
found in either of the Arts Criterion-Groups or in Rest-of-Arts, but not, of 
course, among the Scientists.) 


(3) The Indices Used. 

General Intelligence. The test used was A.H.5, a group test of high-grade 
intelligence specially suited to student samples (Heim, 1956). The test is 
administered in two Parts: Part I being verbal and numerical, Part II being 
diagrammatic, 
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Accuracy. A number of accuracy scores were derived from A.H.5, the 
most useful being ‘ Worse Accuracy ’ and ‘ Range Accuracy.’ A boy’s accuracy 
on Parts I and II of A.H.5 were calculated separately—the number of correct 
answers as a percentage of the number attempted. The lower of the two 
percentages was then defined as his Worse Accuracy, and the difference between 
the two as his Range Accuracy. The accuracy score given in Tables 2-5 is 
Worse Accuracy. 


Spatial Ability. An experimental spatial test was used as none of those 
already published was sufficiently difficult (Heim and Watts, 1957). It was 
aimed to tax students’ ability at manipulating geometrical shapes in their 
mind’s eye, but in practice was found to correlate substantially with General 
Intelligence (r—approximately -5). 

Vocabulary. An experimental, multiple-choice form was used, published 
tests being too easy (Heim and Watts, 1961). 

General Knowledge. An experimental open-ended test was used, involving 
factual knowledge about 100 men, famous in current affairs, literature, painting, 
music, science. 

Cognitive Bias. A differential score expressed the bias of each boy's 
ability in the cognitive tests. This contrasted his Verbal ability on the one 
hand with his Numerical and Spatial abilities on the other. (These scores were 
derived from A.H.5, A.H.5 and the Spatial Test, respectively.) 

Bias of Interests. Interests were assessed by a system of cross-checks. 
Five sources of 1nformation were used, although all five were not available in 
every case; questionnaire, form-ratings, general knowledge, school staffs, 
interviews. A differential score was used to measure each boy's bias towards 
* cultural and political’ as opposed to ' practical and outdoor’ interests. A 
working definition of this second category of interest was reached with some 
difficulty. Certain activities contribute heavily to an Arts/Science discrimina- 
tion: car and radio mechanics, model building, carpentry, photography, 
camping, rock climbing, sailing, bird-watching. While others, ostensibly 
similar, show a significant tendency to obscure it: cine-photography, model 
railways ; hunting, shooting, fishing, cycling, skiing, foreign travel. Similar 
difficulties were found in the ‘cultural’ sphere; interests in literature and 
painting contribute heavily to an Arts/Science discrimination, whereas an 
interest in music tends to obscure it. As the present aim is primarily dis- 
criminative, an operational definition of these categories was felt to be in order. 
So those interests already mentioned as obscuring the Arts/Science discrimina- 
tion were left out of account. 

With one exception, the seven indices mentioned above are reasonably 
independent of one another, the correlations between them ranging from zero to 
:5. The exception concerns the scale of Cognitive Bias and the Spatial Test from 
which it is partially derived, r in this case being of the order of -7. Scores on the 
first five of these indices are graded on a five-point scale, A-E. Grade A repre- 
sents the'top 10 per cent. of each school sample; B, the next 20 per cent. ; 
'C, the middle 40 per cent. ; D, the next 20 per cent. ; and E, the bottom 10 per 
cent. À fuller account of tests and methods is given in Hudson (1961). 


II.—RESULTS. 
(1) Subject-Groups. 
The first aim in presenting the results is to give test ‘ profiles’ for each 
Subject-Group ; to show, for example, whether there are characteristic differ- 
ences between Classicists and Historians. In the interests of clarity, Table 2 
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summarises these profiles qualitatively. The over-simplification which this 
involves is relatively unimportant, because the difference between any given 
pair of Subject-Groups is expressed quantitatively in Table 3. Table 3 indicates 
which of the differences shown in Table 2 are of real discriminative importance. 


Table 2 shows that Classicists are typically of average ability or better, 
and that they are particularly strong in accuracy and vocabulary. Their 
interests are diverse and they have no marked cognitive bias. Modern Linguists 
are unremarkable, apart from a verbal cognitive bias and, very surprisingly, a 
weakness in vocabulary—an inexplicably large number of highly successful 
Linguists had vocabulary grades of D or E. Historians are clearly distinguished 
by low cognitive scores and a strong ‘ cultural and political’ bias of interests. 
Physical Scientists have high scores on cognitive tests, and a marked bias both 
of cognitive ability and of interest. Biologists are undistinguished except for a 
strong ‘ practical and outdoor ’ bias of interests. 


TABLE 1 


Tue SAMPLE TESTED, DIVIDED IN TERMS or (i) SusBjECT-GROUPS, AND (i) CRITERION- 
GROUPS. 


SUBJECT-GROUPS CRITERION-GROUPS 


All-Rounders 
Rest-of-Science 








TABLE 2 


SuBjECT-GROUPS: A QUALITATIVE SUMMARY OF TEST SCORES. 














GENERAL SPATIAL VOCABU- GENERAL COGNITIVE 
INTELLIG- ACCURACY ABILITY LARY KNow- Bras INTERESTS n 
ENCE LEDGE 

CLASSICS  ........ Fair V. Good Good V. Good Fair None Diverse 27 
MopERN LANGUAGES Fair Fair Fair Weak Fair Verbal Diverse 38 
HISTORY woe eee ee Poor Weak Poor Fair Fair Verbal Cultural 68 

& Political 
PnvsicaL Science ..| V. Good Good V. Good Fair Fair Numerical Practical 142 

& Spatial & Outdoor 
BIOLOGY  ....... Fair Fair Fair Weak Fair None Practical 39 


& Outdoor 
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TABLE 3 


INDICES WHICH DISCRIMINATE BETWEEN ANY GIVEN PAIR OF SUBJECT-GROUPS. 


MODERN PHYSICAL Brotocy 
LANGUAGES SCIENCE 








CLASSICS Vocabulary Interests Interests 
i Vocabulary 


MODERN LANGUAGES .. Cognitive Bias — 


HISTORY INTERESTS INTERESTS 
Cognitive Bias 
Spatial 
Intelligence 


PHYSsICAL SCIENCE .... 





STRONG DISCRIMINATION =0%—I15% errors. 
MODERATE DISCRIMINATION — 1695 —259/ errors. 
, Weak Discrimination =26%—36% errors. 


In describing the- discriminations between the various Subject-Groups, 
three standards of discriminative efficiency are established : Strong, Moderate 
and Weak. A discrimination between two groups is defined as Strong if the 
proportion of errors involved is 15 per cent. or less, as Moderate if the proportion 
of errors lies between 16 per cent. and 25 per cent., and as Weak it it lies between 
26 per cent. and 35 per cent. For example: if 15 per cent. of Historians lie 
above a given cut-off score on the vocabulary test while only 25 per cent. of 
Classicists lie below it, the proportion of errors involved in discriminating 
between these two groups is 20 per cent., and the discrimination is defined as 
Moderate. These three categories are distinguished typographically in both 
Tables 3 and 5; and in each cell of these Tables, the indices are arranged in 
descending order of discriminative efficiency. The cut-off score used in each 
case is the median for the whole schoolboy sample. All the discriminations 
shown in Tables 3 and 5 are statistically significant (P—0-05; X* Test with 
Yates's Correction). 


Although none of the discriminations between Subject-Groups is Strong, 
Table 3 shows that there are a number of significant differences between boys 
who specialise in the various academic subjects. The most obvious is that 

ting Historians from the Physical Scientists. For the rest, Classicists are 
distinguished from Modern Linguists and Biologists by their high vocabulary 
and from both groups of Scientists by the bias of their interests. Modern 
Linquists are distinguished from Physical Scientists by their cognitive bias. 


(2) Criterton-Groups. ; 

The Criterion-Groups consist of boys who are highly academically success- 
ful; and they differ only in their academic bias. So contrasts between 
Criterion-Groups should reveal whether academic bias is reflected in test scores ; 
and the contrast of the Criterion-Groups with the rest of the sample should 
reveal whether academic success is so reflected. The profiles of the four 
Criterion-Groups and the remainder of the sample are summarised qualitatively 
in Table 4. 
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In general, the Arts Specialists have poor cognitive scores. They are also 
distinguished from Scientists by their interests, and from all other Criterion- 
Groups by their cognitive bias. The other three Criterion-Groups tend to have 
high scores, and the differences between them are relatively slight. These lie 
in interests, cognitive bias and, perhaps, spatial ability. The two ' Rest' 
groups score rather similarly. It seems that they occupy middle ground between 
the Arts Specialists at one extreme and the remaining three Criterion-Groups at 
the other. 

The efficiency of each index in distinguishing between any given pair of 
Criterion-Groups is shown in Table 5. Again, only statistically significant 
differences are mentioned. The results may be summarised as follows : 


(4) On the whole, discriminations between Criterion-Groups are much 
stronger than those between Subject-Groups. 


(b Among successful schoolboys, one distinction is of over-riding import- 
ance: that which separates Arts Specialists (i.e., those who are poor at Science) 
from the rest. 


(c) Relative to this gulf, the dichotomy between Arts and Science is less 
important. Only his interests reveal the direction in which a successful all- 
rounder has specialised. 


(d) Academic bias is reflected excellently among Arts boys, but poorly 
among Scientists. Or—to express the same relation in reverse—the tests 
reflect weakness in Science excellently, but only poorly reflect weakness in 
Arts 


(e) The relative similarity of Arts Specialists and Rest-of-Arts groups, 
and the dissimilarity of both from Arts-All-rounders emphasizes that tests 
do not measure the qualities underlying success in Arts, and also that the Arts 
All-rounder is a relatively unusual type. 


(f) There is a surprising, although non-significant, tendency for Arts 
Specialists' cognitive scores to be worse than those of the Rest-of-Arts group. 


(g) The indices: General Intelligence, Accuracy, and Spatial Ability all 
distinguish boys who are good at Science from those who are poor. Neverthe- 
less, very high Accuracy scores are largely restricted to Arts All-rounders, 
while the Accuracy scores of successful Scientists tend to cluster in the B and C 
grades. Vocabulary measures academic aptitude in Arts and Science alike ; 
and, to a lesser extent, the same seems to be true of General Knowledge. 
Cognitive Bias gives a good indication of academic bias, particularly among 
Arts boys; and Bias of Interests distinguishes Arts boys as a whole from 
Scientists. 


Tables 4 and 5 do not give a clear idea of how efficiently the various academic 
groups could be distinguished in practice, if all seven indices were combined. 
Ás a rough check on the strength of these discriminations, the scores of all the 
Criterion-Group members were placed on cards and an attempt was made to 
identify each boy in the light of the evidence given in Tables 4 and 5. The 
results, shown in Table 6 are most satisfactory : 


(a) Two-thirds of the Criterion-Group members are identified correctly, 
whereas only some 25 per cent. would have been indentified correctly had they 
been sorted at random. As many as 97 out of 100 are placed either correctly or 
in a neighbouring group. 


126 Test Scores and Academic Bias 


TABLE 4 


CRITERION-GROUPS : QUALITATIVE SUMMARY OF TEST SCORES. 














GENERAL SPATIAL VOCABU- GENERAL COGNITIVE BIAS OF 

INTELLIG- ÁCCURACY ABILITY LARY KNow- Bras INTERESTS n 

ENCE LEDGE 

ARTS SPECIALISTS Poor Poor Poor Good Fair Verbal Cultural 25 
& Political 

ARTS ÀLL-ROUNDERS V. Good V. Good Fair V. Good Good None Cultural 25 
& Political 

SCIENCE ALL-ROUNDERS| V. Good Good Good V. Good V. Good None No Bias 25 

SCIENCE SPECIALISTS . V. Good Good V. Good Good Good Numerical Practical 25 
& Spatial & Outdoor 

REST-OF-ÀRTS ........ Weak Weak Weak Fair Weak None No Bias 82 

REsT-OF-SCIENCE Fair Fair Fair Weak Fair None Practical 133 
& Outdoor 





TABLE 5 


' INDICES WHICH DISCRIMINATE BETWEEN ANY GIVEN PAIR OF CRITERION-GROUPS. 


ARTS SCIENCE SCIENCE 
ALL-ROUNDERS| ALL-ROUNDERS| SPECIALISTS 


REsT-Or- 
SciENCE 


REST-OF-ÁRTS 


SPATIAL SPATIAL 
ACCURACY COGNITIVE BIAS 
INTELLIGENCE | INTERESTS 
Cognitive Bias| INTELLIGENCE 
Interests Accuracy 


INTERESTS 
COGNITIVE B 
Spatial 
Accuracy 
Vocabulary 


ACCURACY 

.| COGNITIVE Bras 
INTELLIGENCE 
Spatial 








ARTS 
ALL-ROUNDERS 


SCIENCE 
ALL-ROUNDERS 


SCIENCE 
SPECIALISTS 


REsT-oF-ARTS 


INTERESTS 


INTERESTS 


Cognitive Bias 


STRONG DISCRIMINATION= 0%—15Y errors. 
MODERATE DISCRIMINATION = 1675—2555 errors. 


Weak Discrimination =27%—35% errors. 





INTELLIGENCE 
Accuracy 
Vocabulary 


Cognitive Bias 


INTELLIGENCE 
GENERAL 
KNOWLEDGE 
Vocabulary 
Accuracy 
Spatial 


SPATIAL 
COGNITIVE Bias 
INTELLIGENCE 
Interests 
Accuracy 





INT ERESTS 
VOCABULARY 
Intelligence 


VOCABULARY 

General 
Knowledge 

Intelligence 


Intelligence 
Spatial 


Cognitive Bias 
Interests 
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TABLE 6 


AN Atrempr TO IDENTIFY CRITERION-GROUP MEMBERS, UsiNG ALL SEVEN ÍNDICES 
TOGETHER. 


Identified as— 
“ Arts Spec." “Arts AII-R" “Sc. AII-R" ‘Sc. Spec.” 


n 
ARTS SPECIALISTS 18 
ARIS ALL-ROUNDERS 


SCIENCE 
ALL-ROUNDERS 


SCIENCE SPECIALISTS 





(b) There is some confusion between neighbouring groups, particularly 
between the two groups of Scientists ; but the discrimination between the other 
ups is good, and that between the two extremes, the Arts and Science 
pecialists, is almost complete. Not one Arts Specialist was identified as a 
Science Specialist, or vice versa. 

(c) The discrimination between Arts and Sciences is also surprisingly 
strong ; only four Arts boys were identified as Scientists, and only six Scientists 
were identified as Arts boys. 

The samples used here are rather small, and a certain degree of chance error 
is inevitable. When the research is extended to other schools, particularly 
those with different academic traditions, changes must be expected. (It has 
been found, for instance, that the intellectual calibre of Biology specialists varies 
considerably from one school to another.) It seems unlikely, on the other hand, 
that the major differentiations will be materially altered. 


(3) Other Findings. 

(i) Gifted Scsentssts. The general level of test scores among successful 
Physical Science specialists is high ; and, in practice, most gifted young scient- 
ists are easily identifiable from their scores. There are, nevertheless, a few 
exceptionally gifted Physical Science specialists whose scores are unaccountably 
low. Perhaps three cases are sufficient to illustrate this type: D., a mathematical 
' prodigy ' ; P., a Chemistry specialist who has since gained an Open Scholarship 
at Cambridge ; and R., the best of a remarkable group of young scientists at the 
public day school. Their test scores are respectively : General Intelligence—C, 
D, C; Accuracy—E, C, D ; Spatial Ability—A, D, C; Vocabulary—B, B, C; 
General Knowledge—C, B, A. All three have strong or fairly strong interests, 
but none has either a ' political and cultural ’ or ' practical and outdoor ' bias. 
All three are considered by their form-mates to be hard-working. In general, 
their remarkable talents reflect themselves in their test scores in hardly a 
single respect. 

Although such cases account for a mere 2-3 per cent. of the whole sample, 
they demand an explanation. The most obvious is that the boys concerned were 
not trying. With most of them, this can be ruled out of account. Most of them 
are notably conscientious ; and as their academic reputation usually preceded 

hem, they were watched with special care during the testing sessions, A second 
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possibility is that these boys were made over-anxious by the novelty of the test- 
ing situation. Again, this seems feasible in a few cases but not in all. There are 
marked differences of personality and emotional maturity within this group, 
so it is unlikely that a single emotional factor could account for all of them. 
A third and much more likely explanation is that there are important scientific 
abilities which the present tests miss. There is, after all, no prima facie reason 
for believing that the complexity of scientific work can be epitomised in the 
present handful of paper-and-pencil puzzles. Indeed, it would be strange if it 
could. There are several normal scientific skills which conventional tests take 
little account of ; the ‘ inductive’ skills of marshalling complex evidence, for 
example, and accurate generalisation from incomplete evidence. Also there is 
the whole area of creativity (e.g., Getzels and Jackson, 1962). If new testing 
methods are needed, the best way of discovering them is presumably a detailed 
study of present anomalies. 

(ii) Extreme Arts Specialists. Another tendency obscured in the general 
presentation of results concerns a small group of Arts Specialists. These are 
boys who are outstanding at Arts subjects but very weak at subjects which tax 
their powers of formal reasoning, e.g., Mathematics, Physical Science, Latin. 
This group shows a strong negative relation between such measures as Geneial 
Intelligence, Accuracy, Spatial Ability on the one hand, and General Knowledge 
on the other. The scores of three of these extreme Arts Specialists—C., G., S.— 
are: General Intelligence—E, E, E; Accuracy—E, E, E ; Spatial Ability—E, 
E, E; Vocabulary—B, C, B ; General Knowledge—A, A, A. All three have a 
very strong ' political and cultural ' bias of interests, and a heavily verbal bias 
of cognitive ability. Both C. and G. are exceptionally hard-working. The 
General Knowledge scores of these three boys are, respectively, the fourth 
highest, third highest and the highest in the whole sample; while, with one 
exception, G. has the lowest General Intelligence in the whole sample, and C. 
the lowest but one. As with the Physical Scientists mentioned above, this type 
is exceptional ; such boys account for less than 5 per cent. of the whole sample. 
Yet, they are possibly of even greater practical importance, because it is 
precisely this type of boy who is filtered out of the academic system by 
university requirements in Latin and Mathematics. 

(iti) Negative and Compensatory Relations. This negative relation among 
extreme Arts Specialists seems to be one instance of a more general pattern. 
Among Historians and Modern Linguists there is a marked tendency for a high 
score on any one test to be counter-balanced by Jow scores on others. In cases 
where a boy’s scores were all low, he was found almost invariably to be excep- 
tionally hard-working. So it seems that in History and Modern Languages 
certain mental abilities are mutually inter-changeable ; and that where ability 
is lacking, hard work will serve in its stead. It will be interesting to see which of 
the various test profiles found among Arts boys bode well academically ; 
and whether qualitative differences of intellectual type are detectable between 
them. Why this form of equivalence should not hold among Classicists, Physical 
Scientists or Biologists is not clear. One possibility is that History and Modern 
Languages admit a wide range of differences in intellectual ‘ style,’ whereas in 
Classics and Science there may be a greater tendency for one intellectual method 
to be the best. 

These results suggest that a compensatory mechanism may be at work in 
some of the boys concerned ; that special weakness in, say, formal reasoning 
has caused them to concentrate on general knowledge. Compensation of this 
kind is illustrated in a case of sibling rivalry. These two brothers were 18 months 
apart in age, yet the younger had overtaken the older academically. They had 
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an unsatisfactory home background; and at the time of his interview, the 
elder brother showed obvious signs of emotional disturbance. The younger 
brother was a Physical Scientist with cognitive scores above the average, a 
spatial bias of abilities, poor general knowledge and exclusively practica] 
interests. The elder was a Historian with very low cognitive scores, a verbal 
bias of ability, good general knowledge, and exclusively cultural interests. In 
effect, the elder boy had concentrated his efforts in precisely those areas where 
his brother was weak. 


(iv) Accuracy, Emotional Stability and Originality. Previous work has 
shown that accuracy scores can be used as an index of emotional stability 
(Petrie, 1948). In the present research, both Worse and Range Accuracy were 
found to correlate substantially with interview ratings of emotional stability 
(Hudson 1961)—r--42 and -65, respectively ; n—51. Individual cases also 
suggested that there may be a connection between accuracy scores and potential 
originality. The Worse Accuracy scores of the six exceptionally gifted boys 
mentioned above were poor. The same is true of their Range Accuracy, their 
grades being E, C, D and E, E, D for Arts Specialists and Physical Scientists, 
respectively. Among Physical Scientists such poor accuracy scores are relatively 
rare, and generally are a sign of academic mediocrity. Although superficially 
anomalous, this finding is reminiscent of some previous suggestions about the 
nature of originality. Guildford (1950), for example, has suggested that creativ- 
ity may stem from a general distaste for repetition ; and Getzels and Jackson 
(1962) found ‘ creative ' students impatient with the routine of classroom work. 


IV.—CONCLUSION. 


With certain important reservations, we may conclude that the academic 
bias of clever schoolboys does reflect itself in test scores with surprising clarity. 
If this relation holds true at university and in research, the present tests could 
be of some use in advising schoolboys about their choice of career. On the other 
hand, there are good reasons for expecting this relation to weaken. The demands 
made by a particular subject, Physics for example, may well change radically 
between G.C.E. ' O ' level and Tripos ; work for Tripos may differ from research ; 
and the various branches of research in an area like Physics may differ among 
themselves (Roe, 1953). So although conventional tests may be said to have 
proved their worth at the Sixth Form level, the influence of ill-defined qualities 
like originality and maturity of judgment may well increase as the present 
sample matures ; and if they increase, conventional tests may tend to lose thei1 
value. 

Even at the Sixth form level, much remains to be done. The present tests 
do not reflect ability in Arts subjects, nor do they distinguish certain exception- 
ally gifted young scientists. New tests are certainly needed, but there is little 
indication of what shape they should take. The abilities which the present 
tests fail to detect may be primarily cognitive, e.g., the ability to marshall 
complex evidence. They may be more personal, e.g., imaginativeness, maturity 
of judgment, insight into human motives. Or they may involve differences of 
intellectual attitude and ' style' (Rapaport 1957 ; Getzels and Jackson 1962). 


The fact remains that there are brilliant boys with relatively poor test 
scores, and seemingly dull boys whose scores are relatively high. As Getzels 
and Jackson point out, conventional psychological tests normally account for 
less than a quarter of the variance in academic success ; and it seems that the 
deductive, puzzle-solving abilities which these tests do measure have little 
bearing on intellectual originality. So although there is scope for new cognitive 
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tests of the orthodox kind, the most fruitful research may well lie in those areas 
of attitude and ‘style’ where cognitive and personal qualities interact. In 
the past there has been a tendency to approach these areas obliquely, using 
projective techniques like the Rorschach (Rapaport 1957.) ; but distinctions 
of type emerging from this work have not yet been related to differences of 
attitude and ' QUU discernible in academic practice. Obviously, if these 
distinctions are to stand any chance of explaining intrinsically interesting 
phenomena, this gulf between the laboratory and the world at large will have to 
be bridged. It is at this point that the skills of the natural historian become 
vital. If test scores are to be related to the real world—and this, after all, 
is what validation should consist in—mental testers are obliged to observe 
intellectual workers at work, to analyse precisely what they do and how 
they do it. The immediate task, in other words, is to find ways of describing 
and quantifying those intellectual qualities which do not lend themselves to 
quantification automatically. A good example of the exploratory work which 
this will involve is Getzels and Jackson's (1962) study contrasting ' highly 
intelligent ’ and * highly creative ' school children. This study not only shows 
sensitivity to qualitative differences in school children's thinking, but also 
tackles the formidable problems of description which these qualitative differ- 
ences present. Perhaps it is a good omen for the future of mental testing that 
the statistics these authors use are largely rudimentary. 


Finally, the limitations of the present results must be reasserted. In the 
first place, the intellectual demands made by the various Sixth Form subjects 
are relatively elementary ; in the second place, the choice between them by no 
means determines the course of a schoolboy’s career. A Sixth Form History 
specialist may end up as an accountant, businessman, politician, anthropologist, 
soldier, barrister, clergyman. He may have talents unrelated to his job—he 
may work in a bank and write poetry, or in a customs house and paint. Or he 
may have talents which circumstances force him to neglect. The present 
categories are bound to be provisional, and no really satisfactory assessment 
can be made until the present sample has been followed through university and 
beyond. Their test scores as schoolboys can then be related not merely to 
formal criteria like degree class, but to fall descriptions of their intellectuallives . 
as adults. i 
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CHANGES IN TERMAN-MERRILL I.Qs. OF EDUCATION- 
ALLY SUB-NORMAL BOYS 


BY 
C. S. RUSHTON 
(Sometime Educational Psychologist, Kent School Psychological Service) 
AND 
A. E. STOCKWIN 
(Sometime Deputy Headmaster, Seabrook Lodge School, Hythe) 


Summary. Test results on the Terman-Merrill* 1937 revision of the Stanford- 
Binet Intelligence Scale (Form L) were examined for 111 ‘ educationally 
subnormal ’ boys in a residential special school. A comparison between results 
on entry at the age of 7 years and above, and before leaving between 14 and 16 

, showed a deterioration in mean I.Q. scores, despite the fact that the 
Roberts-Mellonet (1952) corrections were applied. 

The proportion of ‘ verbal subtests ’ passed was consistently lower than 
the proportion of ‘ non-verbal ’ tests,’ at all age levels. 

The hypothesis that the deterioration in mean I.Q. scores between mean 
C.A. 9} and 154 years, was due partially to the increased verbal weighting on 
this test at the higher age levels, was examined and supported. 

It was suggested that the test under consideration 18 unsuitable for the 
all-round cognitive assessment of educationally sub-normal] children. Suggest- 
ions were made for a more suitable type of test, and for further research on 
Terman-Merrill (L). : 


I.—INTRODUCTION AND PREVIOUS RESEARCH. 


PREVIOUS investigations on this subject have yielded conflicting results. Many 
of the earlier findings were consistent in that the mean I.Q. of dull children 
appeared to decline with age. Thus, Phillips (1940) concluded that for children 
with Terman-Merrill I.Os. between 50 and 80, there was a tendency for a general 
and substantial decline which increased progressively with age. Parker (1934), 
Thorndike (1940) and Stott, (1960) all show increasing deterioration of I.Q. 
scores with increasing interval between test and retest. 


Perhaps the first and most striking contradiction of these findings came 
from Wellman (1934), who reported that all the groups which she tested tended 
to show improved I.Qs. while ' the most backward group’ showed the greatest 
increase. Schmidt (1946) and the Clarkes (1958 and 1959) also reported striking 
gains in the I.Qs. of the ‘feeble-minded,’ but the former study (Schmidt's) 
was severely criticized by Kirk (1948) and cannot now be taken seriously. 


More recent studies have had available more accurate measuring techniques. 
Collman and Newlyn (1958) for instance, on a test-retest study of education- 
ally subnormal pupils in special schools in Essex, found that the use of the R.M. 
correction table (1952) was fully justified. On re-test they found no significant 
mean change in 1.Q.—but this was after a period of only one year. Hiskey and 
Sadnavitch (1958) and Scarr (1953) report similar findings. 


* From this point onwards referred to as T-M (L). 
+ From this point onwards referred to as R.M. corrections. 
$ From this point onwards referred to as E.S.N. children. 
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In a private communication, P. F. Portwood states that when I.Q. scores 
on T-M (L) were considered without the R.M. corrections being applied, there 
appeared to be a significant drop in I.Q. scores after a two-year interval. When 
the corrections were applied, however, the significant drop (due to the increasing 
S.Ds. between these age levels and thus to a spurious statistical effect) was 
changed to a slight though statistically insignificant increase. Here again is 
evidence which appears to confirm empirically the validity of the R.M. correc- 
tions, and the advisability of using them when dealing with T-M (L) I.Qs. on 
E.S.N. pupils. 


IL-——PRESENT INVESTIGATION—METHOD. 


A comparison was made between the T-M (L) I.Qs. of 111 E.S.N. boys ina 
residential special school in Kent on ' First Testing ' (usually at the age of 8 or 
9 years), and ' Last Testing’ (between the ages of 14 and 16 years). All testing 
was done by assistant school medical officers or by the deputy headmaster, who 
had been trained in the administratisn of T-M (L). 

For testings below C.A. 15 years, R.M. corrections were applied to all I.Qs. 
Above the age of 14 years 11 months no R M. corrections are published, yet for 
the 15-year-old age group the mean I.Q. is 102-8 and S.D. 19 (Terman and 
Merrill 1937), and thus needs correction. MacNemar, in his earlier correction 
table (1942) used the same corrections for C.As. 14y. 6m. to 15y. 5m., as Roberts 
Mellone do for C.As. 14y. 3m. to 14y. 5m. (1952). Thus, it was considered 
justifiable to use these corrections for test results in the present investigation 
between C.A. 15y. Om. and 15y. 5m. Above this age MacNemar does not offer a 
correction, and at the 16-year age-level S.D. drops to 16-5 (Terman and Merrill, 
1937). Thus, for all test results above C.A. 15y. 5m., no correction was made. 


Means and S.Ds. were computed on these I.Qs., and significances of differ- 
ences between means were obtained, using Sandler's ‘A Test’ (Sandler 1955), 
which he claims has been ‘rigorously derived’ from Students ' /' Test, and 
corresponds to the two-tail form of ' /.' 


IIL—RESULTS AND THEIR TREATMENT. 
A. Subjects. Of the 111 boys studied : 


(1) 69 per cent. came from homes in ‘ urban ' areas, and 31 per cent. from 
‘rural’ areas. 

(2) 71 per cent. came from families where the father was ‘ unskilled’ or 
‘semi-skilled’ (Grades IV and V of the Registrar-General's categories of 
occupation groups), as contrasted with 264 per cent. in the general population 
of Kent (H.M.S.O. 1954). This finding is in line with the results of earlier 
researches. (Jones 1954). 


(3) Only 29 per cent. came from families in the ' higher’ occupation 
groups (Registrar-General Grades I, IT and III), as contrasted with 73-4 per 
cent. in the general Kent population (H.M.S.O. 1954). 


(4) 62 per cent. came from families where there were three or more child- 
ren, as contrasted with the normal 27 per cent. of families in Kent of this size 
(H.M.S.O. 1954). On the other hand, only 14 per cent. of our E S.N. population 
were ' only ' children, as opposed to 42 per cent. in the general Kent population 
(H.M.S.O. 1954). Eysenck (1953) also reports that dull children tend to come 
from large families. 
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B. Test Resulis. 

(1) For the population of E.S.N. boys tested, there is a statistically sig- 
nificant ' deterioration' in mean T-M(L) I.Qs. between the ages of 9y. 2m. 
(before entry to residential special school) and 15y. 6m. (just before leaving 
special school). (See Table 1). 


TABLE 1 


Tzsr-RETEST Dara T-M (L) I Qs. wire R.M. CORRECTIONS APPLIED. 


First Testing Last Testing 





71-0 66-0 
7-91 9-43 
47 to 96 46 to 93 


2m 
1-72y. y- 
6y. 11m. to 13y. 4m. | 14y. Om. to 16y. Im. 








5-0 I.Q. points 
Sandler's A 0308 
Level of Significance Below -001 
Product Moment " r” QS. vov +0-626 





TABLE 2 


ANALYSIS OF IQ. CHANGES ON T-M (L) witH R-M CORRECTIONS APPLIED. 


L.Q. Q. 
Detenorated Improved 
Level of 


Significance 





Total N= 111 





Below 60 
71 to80...... s 
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(2) The correlation coefficient (r— 4-0-626) is in general agreement with 
the figure which one would expect after a test-retest interval of about 64 years. 
Eysenck (1953) and Stott (1960) report that the evidence from thirty similar 
test-retest studies indicates a correlation of -79 after 4 years, with a drop of 
about -04 points in the coefficient for every additional year intervening. On 
these predictions, for a test-retest interval of 6$ years, one would expect ‘r’ to 
be +0-69., whereas we find +0-626. This lower correlation may well be due to 
the fact that many different school medical officers were involved in the testing. 


(3) From Table 2, it is clear that many more individuals show ' deteriora- 
tion’ than ‘improvement’ in I.Q. on final testing. Chi? test, where ' I.Os. 
constant ' were apportioned equally to ' improved ' and ' deteriorated,' showed 
that this greater number of deteriorating scores was significant at a -05 level. 


(4) If one sub-divides the data, however, into I.Q. ranges, it can be seen 
that there appears to be a ‘cut-off point’ around I.Q. 60 on first testing. 
Those boys with an initial I.Q. of below 60 on first testing, tend to show an 
improved I.Q. (more than 3 points) on final testing. Those boys with an I.Q. 
of more than 60 on first testing tend to show a deterioration of I.Q. on final 
testing of more than 3 I.Q. points. 


These differences in the direction of I.Q. change appear to be statistically 
significant, and greater than one would expect by chance alone. It may be that 
they are partially explicable by the concept of‘ statistical regression towards the 

ean" (McNemar 1940 and Clarke, Clarke and Reiman 1958), since they occur 
in different directions in the two extreme groups, and towards the group means. 
The fact that the group mean itself changes between first and last testing 
cannot, however, be explained on these grounds, as it changes in the ' wrong ' 
direction, i.e., away from the mean score of the population. 
IV.—EXPLANATORY HYPOTHESIS. 
A.—Procedure and Findings. 

In a private communication, Professor A. D. B. Clarke, suggested that the 
most likely reason for the decline in scores was that “ the test material is more 
verbally biassed on retest (Mean C.A. 15-54), than on initial test " (Mean C.A. 
9y. 2m.). His argument is based on the hypothesis that “ the increased verbal 
weighting at the higher age levels is responsible for decreasing the final I.Q., 
since commonly verbal assets are more seriously impaired in E.S.N. children 
than are performance assets." (See also Tansley and Gulliford 1960). 


In order to test this hypothesis, the sub-tests of T-M (L) were classed as 
either ‘ Verbal’ (V) or ' Non-Verbal' (including ‘ Arithmetical') (P) on the 
basis of the classification of test responses published as Appendix I in Kennedy- 
Fraser (1948). The original test sheets of fifty-one boys which were immediately 
available were then re-examined, and each sub-test attempted at either first or 
final testing was labelled ' V test ' or ' P test.' Totals and Chi 2 of their differ- 
ences are shown in Table 3. 


B.—Indications and discussion. 

(1) On Verbal Tests there is no statistically significant difference between 
success rates on first and final testings. The direction of the change, however, 
is towards an improvement in ‘ verbal efficiency.’ 

(2) On Non-Verbal Tests there is no statistically significant difference 
between success rates on first and final testings. The direction of the change, 
however, 1s towards a deterioration in ‘ non-verbal efficiency.’ 
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TABLE 3 


DIFFERENCES BETWEEN VERBAL AND NON-VERBAL TESTS ON T-M (L) wiTH R-M Connz C- 
TIONS APPLIED. 


(N Boys=51) 


VERBAL TESTS 





Attempted Percentage 
Passed 








927 38% 








1580 43-6% 





NON-VERBAL TESTS 





Attempted Percentage 
Passed 





First Testing .... 519 67:3% 2-80 
CESA P=-10 
328 58.596 NOT significant] 








FIRST TESTING 


Attempted Percentage 
Passed 


Verbal Tests . .. 927 3895 
Non-Verbal Tests 519 67-395 








Verbal Tests... 12-71 
P=less than-001 
Non-Verbal Tests Highly 
Significant 








(3) On both first and final testing there is a consistent and highly sig- 
nificant difference between success rate on Non-Verbal tests and on Verbal 
Tests. E.S.N. boys do better on P tests than on V tests from ages 8 and 9 years 
up to ages 15 and 16 years. Generally speaking, they passed only four out of 
10 V tests, but six out of ten P tests. 

Is Clarke’s hypothesis that “ the increased verbal weighting at higher age 
levels is responsible for decreasing the final I.Q. " supported ? 

The experimental data show that on first testing, out of a total of 1,446 
sub-tests attempted, 64 per cent. were V tests, while only 36 per cent. were P 
tests. On final testing, however, out of a total of 1,908 sub-tests attempted, 
83 per cent. were V tests, and only 17 per cent. P tests. Since there is no 
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statistically significant difference on V tests between success rate on first testing 
and on final testing, let us presume that there is no difference at all. We may 
assume that on V tests the success rate is 40-8 per cent. (mean of 38 per cent. and 
43-6 per cent., the actual success rates achieved). Similarly for P tests we 
assume a success rate of 62:9 per cent. (mean of 67-3 per cent. and 58-5 per cent., 
the actual success rates obtained). 

Let us next presume that on first testing 100 tests were attempted—of these 
sixty-four would be V tests and thirty-six P tests. Applying the differential 
success rates just calculated, this would mean that twenty-six V tests would be 
passed, and 22-6 P tests would be passed. Total tests passed on first testing 
would thus be 48-6. 

If we then presume that on final testing, 100 tests were attempted, of which 
eighty-three would be V tests and seventeen would be P tests, applying the 
differential success rates again, this would result in thirty-four V tests being 
passed, and 10-7 P tests. The total tests passed on final testing would, there- 
fore, be 44-7 as opposed to 48-6 on first testing. 

Thus, if the actual number of tests attempted on first and last testing was 
held constant at 100, then considering the different proportion of V and P tests 
attempted, there would, on balance, be an apparent deterioration in the numbers 
of sub-tests passed, given no change at all in ' efficiency level' (i.e., in success- 
rate) from one testing to the other. 

In this hypothetical case then, it would appear that Clarke's hypothesis is 
supported, i.e., that the increased verbal weighting on final testing is responsible 
for decreasing the final score. 

If we now consider the actual ' raw scores’ from the experiment, we find 
the success rates for V tests and P tests combined, to be 48-5 per cent. on first 
testing, and 46-2 per cent. on final testing. Chi? of this difference is 0-93, which 
is not statistically significant. 

These percentages are similar to the proportions passed in the hypothetical 
case. The fact that in the real experiment the percentage of all sub-tests passed 
on final testing is higher than that in the hypothetical case, must be due to the 
slightly improved ' verbal efficiency ' on finaltesting. (See Table3). 83 per cent. 
of all the sub-tests on final testing were verbal and the improved ' verbal 
efficiency ' on these outweighs the influence of the deteriorated ' non-verbal 
efficiency,' on the remaining 17 per cent. of tests taken. 


C.—General discussion. 

This ' over-weighting ' of T-M (L) with verbal sub-tests from Yr. VII level 
and upwards, has been criticized elsewhere (Clarke and Clarke 1958 and Bull. 
Brit. Psychol. Soc. 1958). The results of the present investigation would tend 
to supoprt these criticisms, though not at an acceptable level of statistical 
significance. The fact that T-M (L) ‘markedly penalizes’ the dull person, 
especially if he is from ' poor or adverse surroundings, would make it an 
unsuitable measuring instrument for use in the testing of E.S.N. children, since 
although it ‘indicates his verbal handicaps ' effectively, it fails to give us an 
all-round assessment of his wider intellectual capacities. 

In the present study, Roberts-Mellone corrections were used throughout, 
& procedure which has been confirmed empirically since the publication of their 
original paper (1952), following MacNemar (1942), by Collman and Newlyn 

1958), by Hiskey and Sadnavitch (1958), by Scarr (1953) and by Portwood 

rivate communication). ‘Such evidence would appear to outweigh the argu- 

ments of Stott (1960), which are not always clear, neither are his basic assump- 
tions universally acceptable. 
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Some of Stott’s findings (e.g., greater deterioration in the scores of girls 
than of boys) suggest that the consistent deterioration which he finds may be 
due partially “to real changes in the rate of mental development " of such 
children, yet it is notable that he concludes, “ this is not to deny that part of the 
variation is also due to faulty standardization.” Present results would suggest 
that the deterioration is partially due to the actual structure of the test, which 
varies in its composition from one mental age level to another, and tends to 
lower the scores of those children with comparatively poor verbal ability, in 
direct proportion to the verbal weighting in the selection of sub-tests used at 
any one testing. 

Stott’s (1960) paper fails to supply adequate information about the 
chronological ages of the children tested, about their mental ages, or about the 
average re-test interval for the children re-tested after 3 years 5 months. The 
present study offers data on a more homogeneous population, with a test re-test 
comparison between scores at junior school age, and then nearing school-leaving 
age. It is notable that in some of the studies discussed above, whether or not 
R.M. corrections were applied, there is a greater deterioration in test scores, 
the greater the interval between test and re-test. In the present study, where the 
test re-test interval is longer than in most, there is a greater deterioration, which 
suggests that the same aetiological factors may well be cumulatively operative on 
all the subnormal populations studied. 

Various theoretical issues remain to be considered. The work of the Clarkes 
(1958a, 1958b and 1959), of Mundy (1957), and of Penrose (1949) strongly 
suggests that for the sub-group we have been studying, there is a relative 
constancy of ‘ mental efficiency ' (and thus of I.Q.), during childhood, but that 
after the age of about 15 years, there are often marked improvements in I.Q. 
Clarke's (1959) hypothesis is that such increments amongst the ‘ feeble-minded ' 
represent ‘‘ not a response to present conditions, but rather a recovery from 
early deprivation," maternal or environmental. This work by no means con- 
tradicts the well-known thesis of Bowlby (1951 and 1956) on deprived children, 
whose follow-up studies have, unfortunately, as yet all been on children below 
the age of 14 years; it merely has a bearing on the long-term outcome. 

O'Connor (1956) points out that the concept of deprivation is still confused, 
but that one important variable in the normal development of abilities and 
character, is a ‘ stimulating environment.’ This point of view is supported by 
Sontag, Baker and Nelson (1958). They found that amongst pre-school children 
‘emotional dependence on parents’ was associated with deteriorating I.Qs. 
during that period—such children were establishing behaviour patterns which 
were not conducive to ' learning to learn ' during these early years. On the other 
hand, pre-school children who developed behaviour patterns characterized by 
aggressiveness, self-reliance, initiative and competitiveness (those from 
' stimulating environments’), were developing personality traits which result 
during school life in gradually improving 1.Qs. Such children seem to develop 
a high need for achievement, which operates as a motive for learning. 

In the present paper it is established that E.S.N. boys from ' working class 
homes ' (i.e., from the two lowest social-occupation groups, from which 71 per 
cent. of our population were derived), score consistently better on the non- 
verbal tests in T-M (L) than on the verbal tests. Bernstein (1958 and 1959) 
found that of 309 G.P.O. messenger boys in adolescence from ' unskilled and 
semi-skilled backgrounds,’ 206 scored Matrices I.Qs. of 101+ while the means 
of their Mill Hill Vocabulary scores fell below this. - 

It has been argued by Hollingworth (1920) and others that in subnormal 
children the curve of ‘ mental development ' rises constantly, but more slowly 
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than in normal, and that there is a cessation of growth at an earlier age than for 
normal children. If this is so, and the curve of mental development is ' less 
than linear,’ then the I.Q. of subnormal children must inevitably fall. Some 
support has been given to this hypothesis by earlier investigators such as 
Phillips (1940). In more recent studies, however, where the varying S.Ds. of 
T-M. (L) have been corrected for, no such deterioration has been found, e.g., 
Collman and Newlyn (1958), Hiskey and Sadnavitch (1958) and Portwood. 
We have seen too in the work of the Clarkes that marked rises in I.Q. level may 
occur at adolescence ; and Davis (1957) has challenged the earlier assumption, 
arguing that environmental influences are extremely important in influencing 
the course of ‘ mental development.’ . 
The influence of environment may also be important in determining the 
significant differences found in the present study between the efficiency of 
E.S.N. boys on non-verbal and verbal tests. We know that they tend to come 
from large ‘ working class ’ families, where parental training, encouragement and 
concern tend to be lacking, and where other investigators have found children 
with poor verbal and academic abilities (Nisbet 1953, Anastasi 1956). Such 
children, like adolescent delinquents (Glueck and Glueck 1950), tend to do 
relatively well on ‘ non-verbal tests’ whether they are of a performance or 
perceptual type. Pringle and Bossio (1958) have presented evidence that groups 
of deprived children obtain lower Verbal than Performance W.I.S.C. I.Qs. 
Tansley and Gulliford (1960) report that E.S.N. children in a residential special 
school tend to score higher W.I.S.C. Performance than Verbal I.Qs. These ` 
writers suggest that the differences may relate to the fact that “ many of their 
children come from broken or unstable homes, which do not provide the best 
conditions for language development." In this connection, Bernstein's hypo- 
theses (1958 and 1959) offer a possible explanation of the underlying causes. 


In view of the recent appearance of Form L-M (Terman and Merrill 1960), 
an appraisal was made of the relevance of the present study to the new form, 
which seems likely to supercede Form L eventually. As in Appendix 1 of 
Kennedy-Fraser (1948), and as in the present study for T-M (L), the sub-tests of 
the new Form L-M were similarly categorized as either ' Verbal' or ' Non- 
Verbal’ (which includes ' Arithmetical’), on the basis of the kind of test 
responses elicited. 

Kennedy-Fraser (1948) reports for T-M (L) the following percentages of 
* Non-Verbal ' sub-tests (including Arithmetical), for M.A. groups Yrs. II—V, 
Yrs. VI—VII, and Yrs. VITI—S.A.3, 59 per cent., 19 per cent. and 26 per cent., 
respectively. For the same three M.A. groups, on Form L-M, the percentages 
of ‘ Non-Verbal ' sub-tests were found to be 61 per cent., 16 per cent. and 30 per 
cent., respectively. Thus, as far as the ' Verbal '—' Non-Verbal' dimension in 
the composition of the new Form L-M is concerned, there would appear to be 
very little difference with T-M (L), and, in fact, many of the sub-tests are the 
same. The conclusions reached in the present study would, therefore, appear 
to apply in general to Form L-M. 


om 


V.—CONCLUSIONS AND RECOMMENDATIONS. 


(1) E.S.N. boys between 7 and 16 years score better on the Non-Verbal 
than on the Verbal sub-tests of T-M (L). This is in line with earlier findings on 
the relative efficiency of intellectually inferior groups on verbal and performance 
tests—E.S.N., deprived and delinquent children and adolescents. This may well 
be related to the sociological findings that 
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(2) E.S.N. boys in this Kent residential school tend to come from homes 
where the father is unskilled or semi-skilled, 1.e., from the lowest two occupation 
groups; and from larger families of three or more children. 


(3) On this experimental population, there are tentative indications that 
between the mean ages of 94 years and 154 years, using T-M (L) tests, ‘ verbal 
efficiency ' improves somewhat, while ' non-verbal efficiency ' deteriorates a 
little. Despite these findings and despite the fact that Roberts Mellone correc- 
tions were used throughout, there was a statistically significant decline in I.Q. 
as measured by T-M (L). 


(4) This finding may be explained partially at least by Clarke's hypothesis, 
that it is due to “ the increased verbal weighting at the higher age-levels " 
(83 per cent. as opposed to 64 per cent.), “ since verbal assets are more seriously 
impaired in such children than are performance assets." This is not to deny, 
however, that other aetiological factors may be operative in causing such a 
deterioration. Although an attempt has been made by use of the R.M. correction 
table to correct for the grosser errors of standardization, yet the factor stressed 
by Stott (1960), “ real changes in the rate of mental development,” due possibly 
to the cumulative effects of a culturally deprivatory environment, or to lowered 
motivation, or to a lowered level of “ mental organization and conceptualiza- 
tion," may also be important. 


(5) Whatever the causes, T-M (L) would appear to be a misleading test 
to use in the assessment of E.S.N. children Although it is a predominantly 
verbal test, and the proportion of verbal items passed improves slightly between 
9% and 154 years, yet its structure is such that the I.Q. appears to decline. 
It would appear inappropriate for the dull child from a poor or detrimental 
environment ; it fails to give an all-round assessment of intellectual capacities ; 
and the different types of items are inconsistently and unequally distributed 
through the different mental age levels of the scale. 


(6) In view of all the difficulties mentioned in this paper, the test under 
question would appear to need very careful interpretation when applied to an 
educationally subnormal population, especially if changes in cognitive level were 
being considered. 


(7) The most efficient type of test for use in the assessment of ‘ dull’ 
children would appear to be one where verbal/educational and performance/ 
du aban items were equally balanced, where separate verbal and performance 

Qs. were obtainable, where all the established ‘ group factors’ were tapped 
equally and were measurable quantitatively, and where the different types of 
items were consistently distributed through all age levels of the scale. (A recent 
attempt has been made by McCarthy and Kirk (1961) to devise a series of tests 
which analyze the specific psycho-linguistic abilities and disabilities of sub- 
normal children, so that detailed diagnosis can lead to effective remedial treat- 
ment. It is as yet too early to judge the effectiveness of such a differential 
diagnostic instrument, but the question is considered by Cashdan (1962).) 


(8) The problems discussed in this paper might well be elucidated, were 
it possible for some investigator to do a comparable test-retest study on 
children at the other extreme of the intelligence range—the ‘ educationally 
superior ' population. Valuable confirmatory evidence for the main hypothesis 
of this paper might also be educed, were some of the earlier studies quoted 
(Phillips 1940, Thorndike 1940, Scarr 1953, Collman and Newlyn 1958, Hiskey 
and Sadnavitch 1958, Portwood (private communication) and Stott 1960), 
subjected to the same sort of statistical analysis. 
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(9) There would appear to -be little difference in the ‘ Verbal '—' Non- 
Verbal ' composition of 1-M (L) (1937), and Form L-M (1960). Thus, the above 
conclusions and recommendations would equally apply to the new Form L-M, 
as to Form L. 
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THE DEVELOPMENT OF CHILDREN’S IDEAS ABOUT 
COUNTRY AND NATIONALITY 


Part II: NATIONAL SYMBOLS AND THEMES 


By GUSTAV JAHODA 
(Unversity of Glasgow) 


Summary. The learning of conventional national symbols (national anthem, 
songs, flags) and the understanding of their significance at various stages were 
studied. A specially devised picture test was used to explore more subtle 
aspects of national consciousness. The broad developmental trends and the 
socio-economic group differences emerging from both parts of the investigation 
are outlined, and some educational implications suggested. 


I.—INTRODUCTION. 
IN the literature of the social sciences a good deal of lip-service is paid to the 
importance of national symbols in the building up of national sentiments. 
For instance, G. W. Allport (1950) wrote as follows: “ As a rule, personal 
loyalty can adhere to an abstraction only when the abstraction is richly symbol- 
ized. Christianity rivets attention upon the cross, nations focus on their respec- 
tive flags." Few attempts seem to have been made to trace the acquisition of 
such symbols in young children, apart from the work of Horowitz (1940) and 
recently Lawson (1963), who concentrated on national flags. The aim of the 
present study is to initiate the exploration of a somewhat wider range of 


symbols 
IL.—THE NATIONAL ANTHEM. 

Six different tunes were played to the children on a tape-recorder in a 
systematically rotated order. Two of them were Scottish, two Irish, one English, 
and the National Anthem. They were asked to say of each if they had heard it 
before ; if recognition was claimed, the following questions were asked : Where 
did you hear it? Do you know what it is? Which country does this tune 
belong to? Here only results relating to the National Anthem will be given. 

Table 1 shows the different kinds of responses. About one-quarter of the 
youngest children gave no sign of recognition. The second group said they had 
heard it before, but were either unable to specify the country, or gave absurd 
answers like “ From the country where Herod is"' (note the tense), or even 


“Tt’s from Wagon Train.” 
» TABLE 1 
RESPONSES TO HEARING NATIONAL ANTHEM. 





Fails to recognize tune 

Claims recognition but does not know 
country, or absurd response 

Claims recognition and attributes to 


Scotland 

Ditto and attibutes to London or 
England 

Ditto and attributes to Britain 


Per cent. unable to give title of tune .... 
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At this point attention. may be drawn to the bottom row of the table, 
indicating the frequency 2 ignorance of the name. For this purpose only 


“ God Save the Queen " ' National Anthem " were accepted; of those 
rejected, five implied awareness that the tune was linked with the monarchy, 
such as “ The royal song’ “The Queen's music.” Beyond the age of 7, 


inability to name the tune $us rare, and in general it was found more often 
among W children (P less than -02). 


Turning now to a consideration of errors of attribution, one finds two major 
trends. The first, most common with W children, was to claim the anthem for 
Scotland. The explanation which suggests itself is that these children are 
unlikely to have attained a conception of Britain as a whole, and a check of 
G-stages supports this : 76 per cent. of them were at or below G.III, as compared 
with only 30 per cent. of those answering correctly (P less than -001). The causes 
of the second error, namely mentioning England or London, are less evident. 
In some cases at least, judging from the replies, it seems to have arisen from an 
association in the children's minds between the Queen, her residence in London, 
and ' her music.' 

The pattern of development that emerges from the comments (though 
owing to limitations of time it was not possible to pursue it systematically) 
may at least be broadly sketched. At earlier ages, the National Anthem tends 
to be just one particular piece of music among others. The name is largely 
unknown, but it is connected with such occasions as school ceremonials, Church 
and Sunday School (e.g., “ We get it at concerts ” ; ‘ In the service in the hall— 
the big ones know the words"; “It’s a hymn ’ h ; other sources include the 
wireless and, especially among W children, television.* These trends continued 
at 8—9, but there were some additions. For M children, these consisted mainly 
of Cubs' or Brownies’ displays, with W ones ‘ the pictures’ or miscellaneous 
occasions like “ Away up the town when the band was playing." Moreover, 
the name then being generally known, a definite link with the idea of the Queen 
had become established. In the case of most of the older (10-11) M children, 
the tune had already taken on the character of a national symbol (e.g., '' Well, 
that can be sung by people all over the British Isles ”) ; with the corresponding 
W group, such progress tended to be much slower, as illustrated by the examples 
below : 

It was on the T.V., when they stand up and sing. (Why do they stand 
up?) Don't know. (W10). 

They play it at the start of an international game or something. It 
could be English, Scottish—anything. (W11). 

Thus, there is a slow and almost imperceptible transition in the meaning of 
this music for the children. At first it may be quite unfamiliar or, if recognized, 
quite wrongly labelled. Later, it comes to be connected first with the person of 
the Queen, and then with Britain as a whole; at that stage it was almost 
invariably correctly named. It was striking to observe how older children, who 
had already grasped this, responded to the questions with a contemptuous 
laugh, because they felt the answers were so obvious; one would hardly have 
guessed that their insight is likely to have been a fairly recent acquisition 


IIL—NATIONAL SONGS 
Whilst the National Anthem is confined to more formal or ritual occasions, 
there are many popular tunes tinged with patriotic sentiments Furthermore, 


* One-third of the 7-year-old W children were able to report that “ It's on the T.V. 
when it's finished.” 
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by contrast with the National Anthem, such tunes tend to be associated with 
the various units which together constitute the United Kingdom. The tracing 
of development in this sphere was the object of putting the following question : 
Many people from this country go to live in far-away places and are a long way 
from home Now imagine they would like to hear three songs: which ones do 
you think they would choose ? 


Answers were divided into three categories. The first consisted of those 
unable to give the name of any song, together with those who produced irrelev- 
ant ones. For all these the question effectively meant “‘ Tell me the names of 
three songs." The youngest children in this category came out mainly with 
religious songs (“ Gentle Jesus," ' Jesus loves me,” “ Away in a manger," 
etc.). At a later age, irrelevant songs tended to become increasingly secular, 
and some divergence between M and W emerged. “ I saw Mum kissing Santa 
Claus " or “ Jailhouse Rock " versus “ Copelia" or “ An adult might like 
opera "). Also included in this category, because their number was relatively 
small, were what might be called transitional answers. In these, the child 
seemed to have some inkling of what was required, but was as yet unable to 
bridge the gap imaginately by putting himself into the position of someone 
far from home. Examples of this would be children who suggested such people 
would wish to hear Italian, Indian or other foreign songs ; one boy said: “ ‘We 
might go to France next year, so I think they should be French ones”; in 
Piaget's terms, this would be a typically egocentric statement. Allthe responses 
so far are grouped under ‘ ‘Nil or irrelevant " in Table 2. 


TABLE 2 


Sones FROM Home. 


M Schools 
6-7 8-9 | 10-11] 67 8-9 | 10-11 


Nil or irrelevant response 
Understood and produced none or one .. 
Understood and produced two or three. . 





The second category was composed of children who clearly understood 
what was required (‘‘ They will want to hear a Scottish song," or words to that 
effect), but either could not think of any Scottish song at all, or were able to 
name only one. There was a small sprinkling of correct non-Scottish answers 
such as “ Carry me home to blighty," which were, of course, credited. The third 
category was the same, except that the children knew two or three songs.* 


One noteworthy feature of this question is the fact that it is one of the few 
which failed to yield any significant difference between M and W. From this, it 
may perhaps be inferred that the establishment of an effective association 
between native songs and the idea of the homeland is largely independent of 
differences in both intelligence and social background. It would seem that 
conceptually the presence of a global, diffuse and undifferentiated notion of, in 
this case, Scotland is sufficient for the formation of the link. Thus, none of the 


* Those most frequently mentioned included: Auld Lang Syne, I belong to Glasgow, 
Scotland the Brave, Roamin' in the gloamin’, Loch Lomond, A Gordon for me, Song of the 
Clyde, A hundred pipers, Ye banks and braes, My heart's in the Highlands. 
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G.I. children understood the problem ; however, the superiority of G.III over 
G.II was slight and not significant (P between -30 and -20). It is also likely that 
children in both kinds of environments are about equally exposed to patriotic 
songs, learning them at roughly the same rate. 


IV.—THE UNION JACK. 


Nine coloured cards picturing, among others, the Union Jack and the 
Stars and Stripes, were spread out in front of the children. At the outset they 
were told to point out those they had never seen before, and these were removed. 
Next, they were asked the names of the remainder, and the countries to which 
they belonged. Finally, children unable to comply were informed that some 
of the flags were Scottish, with the request that they should try and pick them 
out. The last move turned out to have been a mistake, hence, unfortunately, 
no useful material was obtained regarding the Lion Rampant and St. Andrew's 
Cross.* The report on the findings will, therefore, be centred on the Union Jack. 


Results are set out in Table 3. First of all, there were two 6-year-old W 
children who placed the Union Jack among those they had never seen: they 
are included in the top row. It will be noted that the children were generally 
able to name the Union Jack correctly before attributing it to Britain as a whole 
rather than claiming it for Scotland only. The M children were significantly 
more successful on both counts (P less than -01). 


TABLE 3 
THE UNION Jack. 








24 24 
96%, 29% 





Number of children able to identify 
U.S.A. flag 


22 4 





The youngest children were apt to link the flag with concrete situations in 
their spontaneous comments, as shown below : 
It’s the flag you always put up when the Queen comes. (M6). 
That’s the one you get at the circus—my daddy buys it forme. (What 
country?) It’s the Glasgow flag. (W6). 
The Union Jack was often called “ the Queen's flag," indicating an associa- 


tion already seen in connection with the National Anthem. Beyond the age of 7, 
grosser mistakes were mainly confined to W children; apart from calling a 


* 'This vas because of a pervasive tendency to call the whole of the remainder '' Scot- 
tish," whereby they were bound to score by chance alone. 
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variety of different flags “ Union Jack,” they sometimes used the expression in 
a completely uncomprehending manner : 
I think that's the Jack flag. (What country?) Don't know. (W9). 
Yes, that’s the Union Jack. (What couniry?) Is it France? (W9) 

In contrast, to this, the bottom row of Table 3 shows that W children did 
not do significantly worse than M ones in attributing the Star and Stripes to the 
U.S.A. It might be added that a good many more recognized the flag without 
being able to specify the country. Here is a revealing instance : 

That’s General Custard’s flag; I’ve seen him on T.V.; he fights the 

Indians. (W7). 

Lastly, a few impressionistic comments will be offered regarding the 
Scottish emblems. They appeared familiar to many children, though few even 
among the older children were able to name them correctly. The pervasive 
influence of television again seems to have made itself felt, as several children 
mentioned it in connection with the Lion Rampant, this being the symbol of 
Scottish Television. Some of the M children who did best explained that they 
had learnt it with the Cubs or Brownies. On the whole, there was little evidence 
that, within the age-range studied, the Scottish emblems functioned as effective 
symbols of Scottish nationality. 


V.—RECOGNITION OF SCOTTISH PICTURES. 

This procedure was devised with the object of assessing some of the subtler 
aspects of children’s knowledge of their country. They were presented with 
sets of three pictures, reproduced in Plates I-V, and required to indicate for 
each set which—if any—of the pictures is connected with Scotland. 


PLATE I—Costumes 





PLATE III— BUILDINGS 


— ^X 


ET 


Ae hg 


V 








PLATE IV—EMBLEMS 





MM — M a 








ehm moram m 








PLATE V—POoRTRAITS 





"—4 7^ Pines SES ge d ny 





LAAIE O SERERE 


er anainua eh me 





Vae te! dun we 


fma NS iade afa am 
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Table 4 gives the results in a somewhat different form, being confined to the 
number of successes for each sub-group of children. No correction was made for 
possible chance successes, as it was evident from the children's comments that 
their number was negligible. The first set, mainly intended as an encouragement, 
dealt with dress and costumes. The great majority of children had no trouble 
with this. With the landscape set, where a typical Scottish scene had to be 
picked out, W children were conspicuously less successful (P less than -001). 
The difference probably reflects differential opportunities for travelling around 
the countryside, and possibly also differences in the extent to which land- 
scapes decorate their homes, though the latter is entirely speculative. Similar 
considerations would apply to the next set, three buildings including Edinburgh 
Castle (P less than -02). It is noteworthy that several M children were able to 
identify Notre Dame in Paris, one of the ' distractors.' 


After this came a set of conventional emblems, and here the rates of success 
in singling out the ubiquitous Scottish thistle differed even less, though still 
significantly (P less than -05). The last set, featuring " Rabbie" Burns in 
company with Shakespeare and Voltaire, produced a surprise. There was no 
significant M—W difference, either in correctly choosing Burns as the Scottish 
one, or even in knowing his name. Evidently Burns constitutes one of the most 
potent Scottish national symbols, as early as childhood. Below are some 
revealing answers to the question “ How do you know this is Burns? " : 


My grannie's got a picture of him. (M8). 
Because we have a calendar with his face on it. (M7). 


When your mummie's in hospital you go into a fish-and-chip shop for 
your dinner; and when I looked up I saw a picture of him looking at me. 
(W6). (N.B.—This boy recognized neither the National Ánthem nor the 
Union Jack.) 


On the T.V. news—we saw his cottage. (WT). 
It would seem that Burns' presence in Scottish life cuts across the barriers 
of social class, and that is no doubt how he would have wished it to be. 


TABLE 4 


RECOGNITION OF SCOTTISH PICTURES. 


(Numbers of children out of 24 who succeeded on each task.) 
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VI.—ATTITUDES TO OTHER COUNTRIES AND PEOPLES. 


The findings relating to this are reported in detail elsewhere (Jahoda, 
1962), and this section, therefore, offers merely a sketch of broad trends and 
some features of special interest in this country. 


The children's growing awareness of more distant nations began with casual 
glimpses of unusual picturesque detail divorced from any general background. 
This was sometimes combined with concrete and homely, though equally 
fragmentary information gained through such personal channels as relatives 
abroad. The subsequent process of differentiation and organization, too com- 
plex for brief summary, was studied in some detail in relation to America and 
Russia. 

Attitudes tended to follow a distinct pattern. Those of the youngest and/or 
least mature children were determined by some arbitrary and usually concrete 
detail that happened to have created an impression (e.g., " Don't like New 
Zealand because the butter is too salt"). Actually, these are probably too 
evanescent to deserve the term ' attitudes, and are mentioned only because 
they are characteristic of an early mode of thought. Apart from the attraction 
of ' holiday countries, the more stable sources of favourable attitudes seemed 
to be again the existence of personal ties, or a belief in the similarity of the 
people to oneself, resulting in a feeling of affinity ; this last statement, it should 
be added, is somewhat speculative and could not be directly verified from the 
data. Unfavourable attitudes, from a surprisingly early age, have their roots 
-in past wars, but among the oldest or most mature children these are over- 
shadowed by present political conflicts. 


On the salient, and somewhat disturbing findings, was the ethnocentricity 
exhibited by some W children. This took the form of colour prejudice, and a 
marked tendency to look down on people in other countries. It is interesting to 
note, however, that their attitude to the English appeared to be influenced by 
considerations of social class. Thus, a higher proportion of W as compared with 
M children thought the English ' nicer’ in behaviour (e.g., “ In London the 
men are not so rowdy when they're drunk "); moreover, they tended to 
identify an English accent with middle class speech, and to regard the Scottish 
tongue as a working class one, as may be illustrated by two examples : 

Well, the English say “ Good jolly, this is a jolly fine day." The 

Scottish say '' Aye, it's a grand day." The English are a lot of toffs. (W9). 

They talk different—kind of polite, kind of Cockney... (W11). 

Lastly, a few general remarks on attitudes towards the English. From the 
way the children talked about them (and it must be remembered that will 
largely reflect the expressed opinion of their elders) it would seem that the 
traditional antagonism is very much on the wane. There were hardly any 
disparaging remarks about the ''Sassenachs "*, and when questioned about 
dislikes, not a single one mentioned England or the English. 


VII.—CONCLUSIONS. 

The outstanding impression one gains from surveying the children's 
development, in line with the conclusions of investigators concerned with other 
areas of children's thinking, is that of the great intellectual distance traversed 
within the span of a few years. At the outset, their responses reveal an almost 
complete ignorance about the wider geographical and social world surrounding 


* Asked to give another name for the English, only three M children were able to 
produce this ancient word. 
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them; some five years later the outlook of most of them is no longer funda- 
mentally different from that of mature adults. During the intervening years, 
they have acquired a set of conceptual tools enabling them to organize their 
environment meaningfully and to make the main distinction conventional in 
our society. 

The details of this process have been described in this present study. How- 
ever, it was necessary to treat the various aspects more or less in isolation, 
leaving aside the question as to whether there may not be some common thread 
running through the advances in seemingly divergent spheres. An analysis 
carried out with the material as a whole indicates that the central element is 
probably to be found in the extent to which the child has succeeded in differ- 
entiating and ordering ever-widening sections of his geographical environment ; 
and this can be at least roughly assessed in terms of the G-stages. It would 
appear that the particular G-stage reached is an index of the limit to the amount 
of progress possible elsewhere ; children often lag behind, failing to realize their 
potential achievement, but there is no indication that they can exceed it in 
genuine understanding, as contrasted with mere parrot recital. 

Among the youngest children investigated, nearly nine-tenths had not 
advanced beyond G.II. It is thus legitimate to generalize about the age-group 
6—7, because their outlook was fairly uniform. After the age of 7, the increasing 
diversity of G-stages precludes any general statement, which would be mislead- 
ing. 

It may be recalled that G.I. denotes a perspective confined to the child's 
immediate surroundings—home, school and play streets in which they spend 
their everyday life. With G.II an extension has taken place, involving some 
apprehension of Glasgow as a unitary whole, to which the child refers himself 
and of which he feels himself to be a part. As yet, however, no conception has 
been formed about what lies outside. From this starting-point, an inventory of 
the ideas characterising a majority of the children aged 6-7 will be made. 


First of all, the meaning of even the simplest geographical words tends to 
be extremely vague, or altogether mistaken. Thus ‘ town’ is liable to be applied 
to any aggregate from streets upwards. This reflects not only inadequate 
command of this particular concept, but at the same time also inability to 
classify known geographical names in any coherent order; both are really 
two sides of the same coin, Asked '' what is your country ? " the dominant 
answer was “ Glasgow " ; one replied ' I don't live in a country, I live in a 
close.* Whilst they have heard the word “ Scotland," and if G.II usually 
realize that it is a place of some kind, they cannot locate it properly in space ; 
it is thought of as being “ away " somewhere. Requests to say something about 
Britain elicited a collection of absurdities. The most they achieved, with three 
exceptions, was to put forward notions implying recognition that Britain is 
some kind of geographical entity, though invariably a remote one. Consequently, 
symbols relating to Britain as a whole, such as the National Anthem or the 
Union Jack, were at best attributed to Scotland ; a majority of children in G.I 
failed to recognize them altogether. 

When asked whether they were British, a majority of the children in G.I 
and G.II replied negatively ; the same answer was given by nearly all children 
in G.I to the question as to whether they were Scottish ; but two-thirds of 
those in G.II affirmed that they were Scottish. In view of the evident confusion 
prevalent even among the latter, this result may appear surprising : why should 


* Close' stnctly an entrance or passage, but extends to the common stairs of a 
tenement, or the building as a whole. 
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they- be ready to acknowledge being Scottish, when they clearly fall short of 
understanding the relationship between Glasgow and Scotland? In order to 
explain this, one has to abandon the exclusive concern with purely cognitive 
aspects. These children are growing up in a symbolic environment (i.e., that 
part of their environment consisting of words, signs, etc.) where Scotland and 

ings Scottish are bound to figure rather prominently. Hence, even before 
they are capable of building up a coherent conceptual framework serving to 
categorize their geographical surroundings, they will become familiar with such 
words as '' Scotland " and “ Scottish," as well as to a lesser extent “ Britain ” 
and “ British." Moreover, some broad and loose associations, normally with a 
favourable emotional tinge, will tend to be established between these words and 
certain items of their experience. This was often apparent in earlier pages: 
children were reminded by the word “ Scotland " of their family, of Highland 
Dancers, and so forth. An accumulation of such associations leads to a readiness 
to acknowledge themselves as “ Scottish " ; but until such time as an adequate 
concept of “ Scotland " and “ Scottish ” has crystalized, such quasi-identifica- 
tions remain unstable and apt to be withdrawn when challenged. 

The purpose of labouring this point at some length is to show again how 
young children are sometimes able to use these words in a formally correct 
manner, when more careful probing would reveal that conceptual grasp still 
eludes, them. In fact, during the first two or three years of their school career, 
most children have hardly begun to make sense of the wider geographical, 
social and political setting around them. It will not have escaped the readers' 
attention that it was necessary to qualify statements about the youngest 
children, because even they were far from uniform. Thus, one 6-year-old was 
already in G.IV, whilst at the other end of the scale, two 10-year-olds had not 
moved beyond GIL This variability of individuals cannot be stressed too 
strongly, because it is in danger of being overlooked when one thinks in terms of 
age trends, and this applies most of all to the intermediate ages. À majority of 
the youngest children are likely to be in G.I or G.II, and most of the oldest in 
G.III or G.IV, but those aged 8-9 are distributed over the whole range. This 
holds true for children from M and W schools taken separately, as well as for 
the total sample. 

In general, the children from M schools were found to be significantly in 
advance of those from W schools; however, when M and W children were 
matched for intelligence test score, the difference disappeared. It was, therefore, 
concluded that this factor accounts for a large part of both individual variations 
and group differences. 


: VITI.—Somz EDUCATIONAL IMPLICATIONS. 
“ He that hath names without ideas, wants meaning in his words, and 
only empty sounds.” It would be entirely in the spirit of John Locke to 
add that the passage from names as more or less empty sounds to names backed 
by fully elaborated concepts is the outcome of a gradual learning process. 
However, it is not always easy to know whether or not a child who has a name 
speaks more than empty sounds. An adult may readily be led to assume that 
notions self-evident to him will also be within the reach of the child. One might 
again turn to Locke for an example.: “ The relation also that things have to 
one another in their places and distances is very obvious to observe ; as above, 
below, a mile distant from Charing Cross, in England, in London.” An analysis 
of the concepts underlying such statements, along the lines sketched in the 
present study, will indicate that to a child such relationships may be far from 
obvious. 
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What bearing does this have on the teaching situation ? The simple answer 
would be that the teacher should not jump too far ahead of the children’s 
capacity for adequate comprehension ; but this is hardly a very helpful recom- 
mendation. In practice, the teacher could not hope to ascertain the stage of 
conceptual development reached by each child, and even if this could be done, 
some compromise would be unavoidable. More effective action would be a recon- 
sideration of the manner of teaching such subjects as Geography and History. 
An examination of the Schemes of Work for teaching these in primary schools, 
keeping in mind the findings reported, will reveal the presence of numerous 
assumptions that are likely to be invalid ; for instance, what kinds of ‘ simple 
geographical terms ' are regarded as having to be taught, and which ones seem 
to be tacitly taken for granted ? 

Another possible suggestion is that some account be taken in planning the 
curriculum of what appears to the dominant, if not universal sequence of 
concept development from the immediate vicinity outwards. This would rule 
out, say, a leap from plans of the school room and adjacent streets to the globe 
or map of the world. The main aim would be a consolidation of the series 
widening out from the home town to Britain as a whole. - 

Finally, it is important to guard against a possible misconception, arising 
from the general discussion of developmental stages. It has been stated that a 
child must attain a certain level of intellectual maturity before he is ready to 
move from one stage to the next; he must be capable of performing the 
appropriate logical operations as well as of handling spatial relationships. 
Thus, subject to the wide range of individual variation already emphasized, 
there is probably a ceiling to what a given child can understand at a particular 
time. However, it might be wrongly inferred from this that there is relatively 
little the teacher can do to help a child along. Such a view presupposes that 
most children more or less automatically advance to the limits of their poten- 
tialities, and this is, unfortunately, very far from the truth. Even in the course 
of the actual investigation, as an unintended consequence of merely trying to 
answer questions of a kind with which they were not normally faced, some 
children had a flash of insight ; things suddenly fell into place, and they reached 
a new level of aderstandin . If this can be brought about so casually and 
incidentally, it is likely that teachers could achieve far more by a systematic 
effort. It is hoped that the present investigation may make some limited 
contribution by pointing the way. 
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THE SELF CONCEPTS OF TRAINEE-TEACHERS IN TWO 
SUB-CULTURES 


By A. S. PHILLIPS 
(Department of Education, University of the West Indies) 


Summary. (1) The aim of this study was to compare the Self Concepts of 
training college students from two different sub-cultures, viz., England and 
Jamaica 

(2) Unstructured compositions by students from both countries, on the 
topic—''I myself: the person Iam; the person others think I am; the 
person I would like to be,” were analysed to reveal the categories students use 
in describing themselves. Some interesting qualitative differences appeared 
though students from both countries used the same categories in their self 
description. 

(3) Objective tests for measuring the Self Concept of students were con- 
structed and administered to other students in both countries. The following 
comparisons between the two groups emerged: ~- 

(a) Jamaican students, both male and female, of the regular age range as well 
as the mature, rated themselves significantly higher on the Cognized and 
the Ideal Selves, and were more Self Accepting than their English counter- 


parts ; 

(b) English male students rated themselves at approximately the same level 
as English female students. But, in the Jamaican samples, the female 
students tended to rate themselves higher than the men, and to be less 
Self Accepting. 

(c) There were no singificant differences between the Self Concept scores of 

‘regular’ English and ‘mature’ English students. But the ‘ mature’ 
Jamaican students cognized themselves significantly lower than the 
' regular ’ Jamaican students 


I.—INTRODUCTION. 

WITHIN recent years | the Self Concept of the individual has been receiving more 
and more attention from psychologists. It has come to be regarded as the vital 
part of personality structure, exercising a profound effect on behaviour and on 
happiness. According to Lecky* one of the functions of the Self is to produce an 
harmonious integrated personality. This is achieved largely by an acceptance 
of oneself as one is, not in a sense of complacency, but with a full knowledge 
of the things about oneself one cannot change, and of the endowments one has 
to work on. Such understanding leads to mental health. Besides, our general 
behaviour is to a large extent, regulated and organized by what we perceive 
ourselves to be. 

If the individual, therefore, is to lead a reasonably happy, normal, well- 
adjusted life, free from fears and tensions, from feelings of constant self-reproach 
and utter inadequacies, it is fairly clear that he should acquire as early as possible, 
a Self picture that is adequate and realistic, and which will enable him to think 
of himself in a satisfactory manner. This obviously is a task for the agencies of 
education. Staines? has shown that satisfactory changes can be made in the 
Self Concepts of children, without any lowering of achievement in the ordinary 
school subjects, by merely adapting the methods of instruction  !! the Self 
Concepts of children are of such vital importance, then the Self Concepts of 
trainee teachers must be of equal importance and worthy of further study. 
This research was, therefore, designed to probe the Self Concepts of teachers in 
training. This was done in two different sub-cultures, viz., Jamaica and 
England, and comparisons were made between the two groups. 
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II.—THE JAMAICAN SOCIETY. 

The Self Concepts of students from the Jamaican and English sub-cultures 
might be expected to differ widely as the social forces in these two countries 
have been so varied. Jamaica has been an English colony for over 300 years. 
Many races compose the population, but the two dominant ones are the white 
and the negro. From the beginning of its colonial history, these two races were 
brought together with very clearly defined places in the social hierarchy. The 
white people owned all the property and all the wealth and were thus in a 
position of authority and prestige. The negroes were, in the beginning, merely 
slaves without rights—the chattel of their masters. Emancipation when it 
came, secured the physical freedom of the slaves, but the pattern continued. 
The coloured people were made to feel inferior in so many ways, some subtle, 
somè more brutal, that it was virtually impossible for them to doubt their 
status. This was strengthened by the processes of education, when education 
did arrive for the coloured children. Its pattern was copied almost entirely from 
the United Kingdom. The syllabus of the schools was that designed for the 
United Kingdom. The standards and the ideals were theirs. Even the illustra- 
tions and the pictures presented to children were those prepared for children in 
England. Only in rare circumstances would a child see a negroid type depicted 
in an illustration, and even then it was usually conforming to the stereotype of 
' backward native.' Even the language they were taught was full of expressions 
equating black with bad or undesirable, and white with good or desirable. 
The image held up to the child then, particularly in relation to physical 
characteristics, would be that of the European, and the European type would 
certainly form the Ideal Self picture. 

This probably explains the coloured person's rejection of the negroid part 
of his ancestry (there has been much intermixture between the races in Jamaica) 
and his desire to escape from it, as typified by such practices as straightening 
the hair and using bleaching creams to lighten the complexion. 

Though the more obvious forms of discrimination no longer obtain in 
Jamaica and there is fairly general equality of opportunity for all races, never- 
theless, the attitudes still persist, and might be expected to produce Self 
Concepts full of unworthiness and inadequacy and marked by excessive rejection 
of Self. There should, therefore, be marked contrasts with the Self Concepts of 
training college students from England, where no such forces appear to operate. 


IIL.—SOME UNSTRUCTURED COMPOSITIONS. 

As a first step, groups of training college students in England and Jamaica 
were asked to write unstructured compositions on the topic: “ I myself: the 
person Iam ; the person others think I am, and the person I would like to be.” 
These three sub-divisions of the main topic were taken as corresponding to the 
three levels of the Self, viz., the Cognized Self, the Other Self, and the Ideal 
Self. The aim was to discover the categories that students use in reporting on 
the Self Concept at these three levels, and also to provide qualitative evidence as 
to the differences in the way students from both countries cognize themselves. 

There was evidence in the essays that students were quite honest in giving 
their estimates of themselves, perhaps because they had been assured of 
anonymity. Remarks such as the following were quite frequent : 

` “ If this were to be read by any one in this college it would be quite 
impossible for me to write this.” 
or 
“I find it difficult for me to describe myself, but I shall try to be 
perfectly honest.” 
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There was some evidence too, that the exercise tended to make the students 
somewhat uncomfortable, e.g. : 
“I am, at the moment, not liking putting down all these thoughts 

about myself, because it injures my pride to see myself in such a bad light "' 

or 
i “ This hurts, but this is the truth." 

‘An analysis of the essays showed that students in both sub-cultures used more 
or less the same categories in defining themselves. These categories, with the 
percentages from each group that gave replies in each category (for the level of 
the Cognized Self) are set out in Table 1.* 


TABLE ] 


PERCENTAGE OF STUDENTS WHO MENTIONED ITEMS IN VARIOUS CATEGORIES AT THE 
LEVEL or THE COGNIZED SELF. 





rooming 
. Maternal Possessions. . 
. Home and Family .. 


. Animals and Pets.... 
. Intelligence 


- Social Relationships.. 

. Status and Role .... 

. Wants, Goals and 
Aspirations 

. Aesthetic Life 











Certain qualitative differences in the way in which students in each group 
described themselves also emerged. For instance, there was a tendency for the 
English students, especially the male students, to denigrate themselves. For 
example, one man wrote : 

. “J make a hell of a lot of noise; I talk too much, drink too much, 
lie too much, but I can't work up a conscience about any of these things." 
On the other hand, the Jamaican group tended to be largely uncritical of them- 
selves, except in minor facets of personality. They tended to express thernselves 

in superlatives, and to see themselves through very rosy spectacles, e.g. : 
“ (Y am) a warm-hearted person (and) have some of the finest qualities 

of which human beings are capable." 

* For further details, see Phillips, A. S —'' A Study of the Self Concepts of Selected 


Groups of Training College Students in their relation to other variables in the Teacher 
Training Situation." Unpublished Ph.D. Thesis, London University. 
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“Tam... person of more than average ability. I am straightforward 
in all my dealings, thus causing my character to be beyond reproach." 
Certain similarities were evident among the various groups, however. Table 1 
shows the high percentage in all groups that gave responses in the categories 
of ' Personality and Character, ' Performance, and ‘Social Relationships.’ 
This is in accordance with Jersild's* claim that mature people tend to see them- 
selves in terms of inner forces rather than of external qualities. The generally low 
frequency of responses in such categories as ' Material Possessions,' ' Clothes, 
Dress and Grooming,’ ' Status and Role,’ etc., applied to all groups in both sub- 

cultures. 

Much greater emphasis was placed by both groups on the Cognized Self, 
rather than on the other two levels of the Self.* This may have resulted merely 
from the fact that the Cognized Self appeared first in the essays and so received 
far fuller treatment, though many students admitted that they had great 
difficulty in knowing what other people thought of them. Also, some, particular- 
ly among the Jamaicans, claimed that they would not wish to be any different 
from what they were. 


IV.—CONSTRUCTION OF AN OBJECTIVE TEST FOR MEASURING THE 
SELF CONCEPT. 

In order to secure more objective and quantifiable measures of the Self 
Concept, a test of the self rating type was constructed. This was divided into 
two parts measuring respectively the Cognized Self and the Ideal Self. It was 
decided to investigate only those categories at these two levels of the Self, as 
were actually reported on by the students in their essays. Twelve categories 
only were investigated, as one, viz., ‘ Animals and Pets ' did not appear to be of 
sufficient universality to warrant inclusion. Later, as a result of statistical 
analysis, another category, that of ' Aesthetic Life' was also omitted, leaving 
eleven categories in the final form of the test. 

The individual items in the test consisted of statements made by the 
students in their essays. The main criteria for selection of these statements 
were that they had to be applicable to p ee of different races, living in different 
countries, to both sexes, and to people of varying ages. Also, items had to be 
positively framed, that is, indicating possession of a desirable quality. Students 
were asked to rate themselves on a five-point scale (from 0—4) as far as each item 
was applicable to them. This gave a score for each item, and as items were 
grouped into categories, a score for each category. A summation of all items 
gave a total Self Concept score for the level of the Cognized Self. For the level of 
the Ideal Self, students were asked to rate themselves on these same items, not 
as they thought they were, but as they would like to be. The algebraic difference 
between the scores on the Cognized and the Ideal Selves (the discrepancy score) 
gave a measure of Self Acceptance. 

An unequal number of items was selected for each category as some cate- 
gories were regarded as of wider conception than others. After the first trial 
run, it was apparent that certain categories hung together, covering more or 
less the same area of the Self Concept. It was, therefore, decided to group 
categories into areas. Evidence for such grouping was provided by correlating 
scores for various categories with each other. Next, the items forming an area 
were tested for evidence of scalability according to the Guttman technique, and 


this revealed that they did, in fact, form a scale. On this basis, five clear areas 
of the Self Concept were revealed, viz. : 
(1) Possessions and Desires. 


* See Phillips, A. S., loc. cit. 
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2) Personality, Character and Beliefs. 

3) Social Aspect. 

4) Performance Aspect. 

(5) External Appearance. 

All further comparisons were done in these areas. 

The reliability of the test was determined by a re-test after an interval of 
three weeks, when a product-moment correlation coefficient was calculated. 
This gave an overall figure of -++--87 for the Cognized Self and +-80 for the 
‘Ideal Self. This was regarded as adequate.* 


V.—COMPARISON OF STUDENTS FROM ENGLISH AND JAMAICAN COLLEGES. 
Table 2 gives the numbers and average age of students in each sample. 

* Regular ' students were taken as those between 18 and 24 years; ‘ Mature’ 
students were taken to mean those over 25 years. Unfortunately, it was not 
possible to obtain large enough samples of mature male students in either 


country. 


TABLE 2 
NUMBERS AND AVERAGE AGE OF SAMPLES. 


Average Age 


Regular English Female 19 years 4 months 
Regular Jamaican Female 20 years 4 months 


Regular English Male 20 years 8 months 
Regular Jamaican Male 20 years 11 months 
Mature Enghsh Female aie 36 years 4 months 
Mature Jamaican Female 30 years 5 months 





In comparing scores for training college students from both sub-cultures, 
only total self concept scores are given here.t These are set out in Table 3. 


TABLE 3 


MEAN SELF CONCEPT SCORES. 








CoGNizEeD SELF 
Regular Female 19 +3012 -90 


Regular Female 
Regular Male 
Mature Female 


SELF ACCEPTANCE 
Regular Female 
Regular Male 
Mature Female 











* For complete test, see Phillips, A. S., loc. cit. 
T For full comparison, see Phillips, A. S., Joc. cst. 
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It had been expected that the Jamaican students would cognize themselves 
less adequately, that is lower than, their English counterparts, and be less 
self accepting (i.e., have a higher discrepancy score). It was revealed that the 
Jamaican students, both male and female, those of the regular age range, as 
well as the mature female students, rated themselves significantly higher on the 
Cognized as well as on the Ideal Self. They also had a lower discrepancy score 
(i.e., they were more self-accepting) though this difference does not reach 
significance for the mature female group. These trends are generally true for all 
areas of the selves. 


Itis not easy to explain the higher rating achieved by the Jamaican student 
over his English counterpart. Perhaps it has its roots in the awakening national 
and racial consciousness evident in Jamaica over the last twenty years. During 
this period has come a more advanced political constitution for the island, and a 
widening of opportunities for its dark skinned inhabitants. An unofficial colour 
bar, which operated prior to 1940, has disappeared. Generally speaking, the 
negro has acquired a new pride in himself. But this does not explain why the 
Jamaican students, even the much older ones, should cognize themselves so 
much higher than (instead of more nearly equal to) the English students. 
Another possible explanation is that defence mechanisms are at work. Many 
forces have been trying to impress on the Jamaican, his unworthiness and 
inadequacy. But he cannot live with a self picture as unlovely as this. So he 
seeks reassurance by conceiving of himself as much better than he actually is. 
Hilgard! commenting on the operation of the mechanism in the Self Concept, 
points out that the need for self deception arises because of a more fundamental 
need to maintain or restore self esteem. 


Perhaps, however, the explanation lies simply in the differing experiences 
of the respective groups. The Jamaican samples were drawn from a level of the 
population which, by and large, did not have many opportunities for getting 
ahead and achieving success. But these particular ones had achieved greater 
success than their peers. Thus, they saw themselves as more intelligent, of better 
personality, able to achieve more, etc. On the other hand, their English counter- 
parts, were less successful than their peers from grammar schools who had gone 
on to university or entered other more highly esteemed professions. It should be 
noted, too, that though the Jamaican samples had a higher Ideal Self, they were, 
in fact, more Self Acceptant. 


TABLE 4 


Sex DIFFERENCES IN THE SELF CONCEPT. 





172 -5810 -96 176-50] 12.80 
143 -26118 -70 137 -30} 19 -30 


IDEAL SELF 
Jamaican—Regular — .. 181 -75| 14-62 195 -26 
English—Regular 172 -63| 17-80 172 -87 


SELF ACCEPTANCE 
Jamaican—Regular .... 10-18] 10-10 17-36 
English —Regular 29-92! 14-50 35 -02 
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It had been expected that there would be no difference in the scores of men 
and women on the Self Concept scales at either levels of the Self or in Self 
Acceptance in either sub-culture. It was found that this was true, for the 
English samples, but not for the Jamaican samples. For the Jamaican samples, 
though the difference in the total scores of men and women did not reach 
significance, women cognized themselves significantly higher than men on three 
of the five areas making up this total score. On the Ideal Self women were 
significantly higher than men, and they were less Self Acceptant. 

It is possible that the explanation of the difference in Self Concept of 
Jamaican men and women lies in the fact that the Jamaican society is some- 
what matriarchal in structure, though in certain respects it is patriarchal of the 
Victorian flavour.* Women very often have to assume the role of both mother 
and father in the family They are very often the mainstay and general provider. 
It is not surprising, therefore, that they should conceive of themselves in higher 
terms than their male counterparts. But perhaps, they are not satisfied with 
playing this role. This might account for their being less Self Acceptant than 
men. No similar pressures appear to be operating in the English population. 


TABLE 5 
AGE AND THE SELF CONCEPT. 


CoGNIZED SELF 
Jamaican Female 
English Female 


IDEAL SELF 
Jamaican Female. ... 
English Female 





SELF ACCEPTANCE 
Jamaican Female 
English Female 

















It was believed that by the time people have reached adult status, the Self 
Concept would have become relatively stable and would consequently change 
very little in succeeding years. It was, therefore, hypothesised that there 
would be no significant differences in the Self Concepts and Self Acceptance of 
‘regular’ and ‘mature’ training college students in either culture. It was 
found that this was true of the English samples but not of the Jamaican 
samples. There are some interesting differences in the direction of change in 
the different areas however.t 

It seems reasonable to conclude that by early adulthood the personality 
structure of students in the English sub-culture has achieved greater stability 
than the personality structure of students in the Jamaican sub-culture. Con- 
sequently, while the Self Concepts of English students change little with 
succeeding years, Jamaican students are still changing and modifying their 
concepts of themselves. Perhaps they are less mature emotionally than their 
English counterparts. 

* See Kerr, M.? 

tFor full exposition, see Phillips, A. S., loc. ctt 
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VALUE OF THE ORAL RESPONSE IN BEGINNING 
- READING: AN EXPERIMENTAL STUDY USING 
PROGRAMMED INSTRUCTION* 


By JOHN D. McNEIL anp EVAN R. KEISLAR 
(University of California, Los Angeles.) 


Summary. The hasten of the effect of vocalization in reading has not received 
a clear answer. the present study, the procedures of programed instruction 
were used 1n testing the hypothesis that the saying of words and sentences out 
loud by beginners in reading will result in greater ability to recognize and 
understand written words and sentences. An auto-instructional program which 
taught forty words was administered to 182 non-readers in the kindergarten 
during a seventeen-day period. These children had been paired and assigned to 
oral and non-oral groups where they received identical programed instruction 
with the exception that those in the oral group were directed to 'say the words' 
instead of to ' look at the words.’ An examination which called for silent reading, 
not oral reading, was given at the end of the last lesson The results supported 
the hypothesis that the oral response is valuable in learning to read. 


I.—INTRODUCTION. 


How important is it for beginners to say words out loud in learning to recognize 
and understand written words and sentences? In order to collect evidence 
regarding this perennial and fundamental question, procedures of programed 
instruction were used since they offer a high degree of reproducibility and 
control. 


The psychological and educational literature has not given consistent 
answers to the value of oral responding in learning to read. Recognition and 
recall of objects have been found to be greater if the names of the objects are 
vocalized by the learner ê 3, 9. Jeffrey reported research which suggested that 
seli-stimulation, the responding to one's own responses, helps in the over- 
coming of difficulty in learning to apply distinctive names.5 One's own saying 
of a word might serve as a stimulus as well as a response. It is a stimulus in that 
the individual can hear and feel himself talk. '' The feel of saying the word is 
part and parcel of the same pattern of speech stimulation."* 


On the other hand, the observation that many poor readers verbalize has 
led some to the opinion that an oral response in learning to read is detrimental 
to reading achievement. Acting on this assumption, educators like McDade’ 
and Buswell! sought favorable results from a method of beginning reading which 
required no oral responding and in which children were told not even to say the 
roe to themselves. Learners were only to match words with objects, pictures, 
and actions. 


The specific hypothesis of the present study was that kindergarten children 
taught by programed materials which required the saying of words and 
sentences out loud would achieve greater skills in reading than children taught 
by the same programed materials with no instruction to speak out loud. 


* This research project was supported by a grant from the United States Office of 
Education, Department of Health, Education, and Welfare (Grant No. 1413) 
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IL—DESIGN OF THE EXPERIMENT. 


The general plan of the study called for presenting an auto-instructional 
program, consisting of seventeen daily lessons, to a number of kindergarten 
children in small groups, half of whom were required to make oral responses and 
half of whom did not. At the end of the last lesson, the extent of learning was 
measured by a post-test. The presentation of the program to the small groups 
constituted replications of the experiment, providing evidence of reliability of 
the treatment and its effects. 


Subjects. 

The children used in the study had an average chronological age of 5 years 
end were the total kindergarten populations at three selected public elementary 
schools. One school represented an upper-middle class native-born white 
community; one school was located in a lower-middle class community, 
predominately of Japanese and Mexican backgrounds ; and a third was located 
in a middle class community typical of those in most Southern California 
cities. 

Before commencing the experiment, each child was given an intelligence 
test as well as a reading readiness test which measured his knowledge of letters, 
words to be taught in the program, and ability to recognize likeness in letter 
configurations. Children from the same classroom were paired on the basis of 
scores obtained from these tests and also upon their record of absenteeism. One 
member of each pair, randomly selected, was assigned to an oral group, the other 
member to a non-oral group. From these pairs thirteen oral and thirteen non- 
oral groups consisting of ten or less were constructed. 


There were 231 children in the original sample. The subjects who were 
dropped from the study were: forty-three children absent more than three 
consecutive days, four children unable to speak English and two children 
identified by their teachers as hyperactive (emotionally disturbed). The 
remaining 182 children, therefore, made up the subjects for the study. These 
children had an average intelligence quotient of 107 with a range from 72-138 
as measured by the Kuhlmann-Anderson Test of Intelligence. 


IIL.—MzTnROD. 
Programed Materials. 

The instructional program consisted of PRE daily lessons which took 
slightly over three weeks to administer. Each lesson was made up of approxi- 
mately thirty-five questions or frames. Two sets of the programed materials, 
identical in all respects but one, were prepared. The non-oral set carried a 
taped commentary which directed the learner to “ look at the word(s) "; the 
other set directed the learner to " say the word(s)" In order to assure that 
children in both groups were attending to the reading materials, all subjects 
were required to respond to problems throughout the program ; for each prob- 
lem, children were to select one of three choices offered by pressing an appro- 
priate button. The oral group, therefore, made both oral and button-pressing 
responses; the non-oral group made only the button-pressing response. 


Forty words were taught during the program both as independent words 
and as sentences. These words represented terms commonly found in the oral 
vocabulary of young children? and included both function words (i.e., and, with) 
and those with a direct referent (i.e., mother, girl). The printed words taught 
in this study were among those used in everyday speech by children. The 
study was not concerned with extending the child's vocabulary. 
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Examples of the 700 frames ia the program are shown below : 


Purpose. 
To introduce a new 
word. 


Frame. 
Children 
children woman dog 


Commentary. 
“Children. There are 
children in your room at 
school. The word at the 
top says children. (Say 
the word.) (Look at the 
word.) Which word be- 
low also says children ? ” 


(2) To facilitate reading children run ...... “ (Say) (Look at) the 
from left to right and fast girl boy words at the top. Which 
to emphasize relations word below sounds right 
of a printed word to a with the top words? " 
pattern of language 
structure. 

(3) To maintain motiva- the boy and girlcanrun “ (Say) (Look at) the 
tion as well as to test picture picture picture words at the top and find 
for comprehension. A B C the picture that goes 

with the words.” 


To help the learner 
identify sentences 
which are appropriate 
answers to the quest- 
ions he hears. 

To help learners rely 
on more than one cue 
in identifying words. 
(e.g., length instead of 
initial letter or con- 
figuration alone). 

To give practice in 
recognizing and com- 
prehending function 
words. 


(6) 


Bily Billy wego 
can eat runs fast 


children cold chair 


DLP 


" What does Billy do 
when he is in a hurry? ” 


“ Which word says child- 
ren ? » 


“ Find 
word.” 


the missing 


The program aimed to develop four genaralizable skills. Beginning with 
the first lesson, efforts were made to teach children (1) to respond appropriately 
to new combinations of printed words, arriving at a complete thought or 
meaning, (2) to identify the particular word in a sentence which provides an 
answer to a posed question, (3) to respond to a variety of cues in discriminating 
among words, and (4) to read from left to right. To accomplish these four 


objectives, the following principles of program preparation were adopted : 


(a) In order to get the learner to read from left to right, new words were 
introduced on the right of previously learned words (e.g., Frame (2) above), 
(b) 


Pictures were rarely used in the program. The responses to printed words 
were prompted orally by the taped commentary. Occasionally, for moti- 
vational purposes, children would be asked to look at or say the words at 
the top'and then find the picture from among three that '' went with the 
word ” (e.g., Frame (3)). 

The sequence of frames was built around reiteration of the words and the 
generalizable skills of word and sentence recognition. There was no 
narrative or thematic structure. Words and sentences were reviewed at 
unexpected times. 
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The overall difficulty level of the program (percentage of errors on first 
trial) was 33 per cent for the oral group and 38 per cent. for the non-oral group. 
Performance by individuals reflected a range in difficulty from 70 per cent. 
errors to 2 per cent. The average difficulty of the daily lessons ranged from 
23 per cent. to 43 per cent. errors (oral group) and 23 per cent. to 51 per cent. 
(non-oral group). 


The Apparatus. 

The apparatus provided for instruction of ten or fewer children at a time. 
Each child sat in an individual laboratory-type cubicle equipped with head- 
phones through which he heard the taped commentary. The cubicle also contained 
a microphone as well as response and confirmation panels. The response panel 
consisted of three buttons by which the child indicated his selection in answering 
the multiple-choice problems in reading. Confirmation of the correct answer 
was by lights; a green light was shown when the learner made the correct 
response, and a red one when he was in error. 

A single 6-ft. x 8-ft. screen at the front of the room was visible from each 
cubicle. On this screen, learners saw each daily sequence presented by film- 
strip, while simultaneously hearing the accompanying taped commentary. The 
program advanced automatically at a paced schedule with 3-seconds allowed 
for responding to each frame. As each individual in the oral group made vocal 
responses into his microphone, he heard his own voice. Frequent monitoring 
of the individual oral responses through the microphone circuit gave evidence 
that subjects were responding orally as directed. When the non-oral group 
used the equipment, the microphones were deactivated. A record of each 
individual’s multiple-choice performance for each frame was obtained by means 
of a punched paper tape. 

The apparatus, therefore, combined advantages of both group teaching 
(i.e.. economy, control of word exposure time and pacing) and tutorial instruc- 
tion (ie, personal communication without peer group distraction and pro- 
vision for individual response and confirmation). 


The Testing Procedure. 

The criterion test provided evidence that the learner had acquired 
knowledge of the words and reading skills previously mentioned. This post- 
test consisted of fifty-one multiple-choice items, each with three or four altern- 
atives, similar in format to the frames in the program except that the items 
appeared in individual booklets, one item per page. [Instructions for each 
item were given orally. No item in the post-test was identical to any frame in 
the program ; the context and the arrangement of alternatives had not been 
seen previously by the pupil. Twenty-four items required that the learner 
recognize a given word. The remaining items required the child to select the 
individual word or sentence which supplied the answer to an oral question, or 
an incomplete sentence, or which best described a picture. 


Examples of the test items are as follows : 


Objective. Instruction. Test Item. | 
He selects an individual “ Find the word that (a) boy, dog, toy. 
word when it appears with says dog." (b) children, tree, dog. 
those of (a) like and (b) 
unlike shape. 
He selects the word that “Find the missing a dog can play ...... & 
wil complete a sentence word." ball 


never seen before. red, with, no, little 
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The test was administered in group situations, twelve children at a time, 
oral and non-oral together. 


IV.—RESULTS AND CONCLUSIONS. 


(1) Overall Effect of Oral Response. 

The mean score for the oral group on the criterion test was 30-3 (SD=10:0) 
while the non-oral mean was 23-8 (SD—7.9). This superiority on the part of 
the oral responders was significant when based on the average difference 
between matched pairs (t=5-64, p «-01). Since the mean score for forty-seven 
uninstructed kindergarten children was only 12:7 (SD— 3:3), both the oral and 
non-oral forms of the program appear to be effective in teaching. 


The superiority of the oral response group is illustrated in Figure 1 where 
the results are presented for the three groups in terms of percentage of total 
possible scores. Since there was no correction for guessing, the dotted line 
indicates a chance score which would have been obtained by blind guessing. 
Using this line as a base, it may be noted that the effect of the oral response is 
considerable. 


Chance 





Oral Non-Oral Uninstructed 
Control 


FIGURE 1 


COMPARATIVE CONTRIBUTION OF ORAL AND NoN-ORAL RESPONSE Mops IN READING 
ACHIEVEMENT. 


Since the experiment was carried out with thirteen pairs of matched groups, 
we may also view the data non-parametrically. In each of the thirteen pairs of 
groups, the mean of the oral group was higher than the mean of the correspond- 
ing non-oral group. On the basis of a sign test, this difference is statistically 


significant (p «-01). 


(2) Value of Oral Responding at Different Levels of Ability. 
Although it was speculated that the oral response would be especially 
important for children of lower intelligence, the interaction effects were not 
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large enough to be statistically significant. However, the direction of the 

results suggest the desirability of further inquiry regarding the special advant- 

ages of oral responding by those of lower intelligence. Table 1 below shows the 

mean scores by three levels of ability for both oral and non-oral responders. 
TABLE 1 


Post-TEST RESULTS FOR ORAL AND Non-ORAL SUBJECTS. 


Oral Non-Oral 
Written Post-test Wntten Post-test 





Low I.© (N=60) M=25:2 
Middle [ Q. (N=62) M=31-8 
High IQ (N=60) M=33-7 





Total N= 182 M=30:3 





(3) Value of Oral Response in Motivating Children to Want to Read. 

In order to throw light upon the attitudinal effect of the program and the 
oral response, an interview was held with each child. Children from both oral 
and non-oral groups were individually asked whether they would prefer to 
continue with the program when given each of the following as an alternative : 
(1) listen to records ; (2) listen to a story ; (3) paint ; or (4) play in the sandbox. 
Over 72 per cent. of the children in both groups preferred the reading activity 
to one or more of these other activities ; 10 per cent. chose the reading activity 
in preference to all four of the other alternatives. Parenthetically, we can say 
that the introduction of a formal reading program was received with as much 
enthusiasm as currently popular activities in the kindergarten. 


Although achievement in learning was not found to be associated with 
preference for continuing the reading activity, the data suggested that oral 
responding might have motivational properties for those of lower intellectual 
ability (I.Q. below 100). Preferential scores for oral and non-oral groups in 
middle and high I.Q. levels averaged 1:5, but preferential scores for the low I.Q. 
level were 1-6 and -97 for the oral and non-oral groups, respectively. 


(4) Conclusion. 

The results show that oral responding by beginning readers facilitates the 
recognition and comprehension of printed words and sentences. Children who 
responded orally during the auto-instructional program were superior on a test 
which called for the application of a variety of reading skills in contexts different 
from those taught in the program itself. The oral response was of significant 
value even though the post-test called for silent reading, not oral responding, 
and none of the items of the post-test appeared in the training program. 


As indicated earlier, one way to explain the mechanism involved in the 
greater achievement with overt vocalization is that the distinct stimulus of 
saying words aloud is required for mediation of the obtained effects. 

The experiment gives no support for the belief that oral responding in 
learning to read has unfavorable effects. On the contrary, we must conclude 
that, although vocalization and poor reading on the part of the learner are often 
associated, the act of vocalization during instruction is an aid to the beginning 
reader in what for him is already a difficult reading situation. Without such 
vocalization, his performance would be worse. 
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THE INFLUENCE OF EARLY PUBERTY ON TEST 
PERFORMANCE AT AGE ELEVEN 
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SUMMARY. It has been suggested that the earlier onset of puberty among girls 
may now be affecting performance in tests at the age of 11 plus. This question 
was examined by the analysis of test scores at ages 7, 9, 11 and 13 from a 
year-group of girls numbering 1,385, for whom age at first menstruation and 
father's occupation were known The very early maturers proved to have a 
slightly higher average test score than the others, but they were superior at all 
ages and not only at 11. While the results at age 13 do not rule out the 
possibility of a later pubertal spurt, 1t can be concluded that early onset of 
puberty does not confer any advantage 1n selection tests at age 11. 


I.— INTRODUCTION. 


THE evidence for the secular trend towards early maturity has been summarized 
recently by Tanner (1961): for example, he reports that among girls in Norway, 
in 1840, the average age at menarche (or first menstruation) was 17} years, 
whereas to-day it is about 13 years. The trend appears to have been maintained 
steadily over the whole of the last hundred years, and it is still continuing. 
(In earlier times, menarche seems to have occurred around the age of 14. 
Shakespeare’s Juliet was 13: he devotes ten lines of Act I, Scene III, to 
emphasize that she is just “ a fortnight and odd days ” short of her fourteenth 
birthday.) The factors influencing the age of puberty remain uncertain; 
inadequate records and differences in mortality rates make research difficult. 


This trend has aroused concern among those who are responsible for 
operating procedures of selection for secondary education. Girls reach puberty 
earlier than boys; and if there is a spurt in mental growth similar to the 
acceleration in physical growth at puberty, then as puberty occurs earlier, the 
spurt will be beginning to affect the performance of early maturing girls in 
tests at the age of 11 plus. Since the advantage of an early growth spurt is 
only temporary, this would reduce the validity of the selection procedure. 
Tanner (1962) has recently reported that early maturing girls are, indeed, 
scoring higher on average in selection tests than the late maturers. However, 
this finding does not allow for the possibility that the early maturers may have 
a consistent superiority at all ages over the late maturers. 


Other indirect evidence would appear to be consistent with the hypothesis 
that scores at age 11 are beginning to be affected by the earlier onset of puberty, 
though here too other explanations of the findings are possible. The Scottish 
Mental Survey (1949), in which a test given to all the 11-year-old children in 
Scotland in 1932, was given again to the 11-year-old age-group in 1947, showed 
that the average score in 1947 was higher than in 1932. In 1932, the boys were 
slightly better on average than the girls. By 1947, the girls had not only caught 
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up, but had passed the boys, and scored on average two points higher than the 
boys. Emmett (1950) in a similar comparison of age-groups in ten English 
areas over the period 1936-1947, also found that the average score of girls had 
increased relative to that of boys, though there was some indication that the 
girls’ superiority had begun to fall away by the age of 13. Boyne and Clark 
(1959) found a similar increased superiority of girls over boys among Aberdeen 
school children over the period 1947-1957. 


These findings would appear to be in accord with the hypothesis that a 
spurt in mental growth accompanying puberty is affecting performance at 
age 11. However, the data can be explained without assuming an adolescent 
spurt in mental growth. Differences between mean test scores of boys and girls 
may arise from a variety of causes. For example, girls are generally better 
than boys in verbal problems and boys are generally better in non-verbal and 
numerical problems. There is a tendency in many tests to use non-verbal 
material for the simpler items and verbal material for the more difficult items. 
Thus, if there is a slow general improvement in over-all performance (whether 
due to coaching or any other cause), this would also result in a trend towards 
girls’ superiority as the whole age-group moved up into a type of item which 
tended to favour the girls. 


For an adequate test of the hypothesis, a longitudinal study is necessary. 
‘An age-group must be tested at intervals from age 7 (or from any age at which 
it can be safely assumed that puberty has not yet affected test performance), 
up to at least age 13 (by which age it can be assumed that the influence of 
puberty, if it exists, will have begun to show an effect). Comparison of the 
average scores of the early maturers and the remainder provides a test of the 
hypothesis that there is a spurt in mental growth accompanying puberty, and 
will show whether earlier onset of puberty is affecting performance in tests at 


age 11. 


, IL—SUBJECTS, TESTS AND OTHER DATA. 


The population studied was defined as those girls who were in the second ` 
year of secondary schools in Aberdeen—that is, around the age of 13 or over— 
and who were present on a day in December, 1960, on which one of our tests 
was given. For this year-group, the education authority records were searched 
to identify the earlier scores of these girls in the tests which had been given to 
the whole age-group at ages 7, 9 and 11, as part of the regular testing programme 
in the City of Aberdeen. On these earlier occasions, absentees were followed up 
to ensure that a test score was available for every pupil at each age; for lack 
of resources it was not practicable to follow up the absentees at age 13. Although 
all secondary schools, both local authority and independent schools, were 
included in the testing at age 13, earlier test scores were not available for most 
of the girls from two private schools attended by some 6 per cent. of the age- 
group of girls, as only a few of the girls at these two schools had taken the 
earlier tests. Migrants into the city also missed the earlier tests. In the end, 
complete test records from age 7 were traced for 85 per cent. of those tested at 
age 13. The sample is thus slightly biased towards the lower occupational 
groups. 

The tests used were: at age 7, Moray House Picture Test; at age 9, 
Schonell's Essential Intelligence Test ; at age 11, Moray House Verbal Reason- 
ing Tests 57 and 58 ; and at age 13, Moray House Advanced Verbal Reasoning 
Test 10. Not all the sample sat the earlier tests at the same time: thus, at age 
11, the large majority sat Moray House Tests 57 and 58, but a few sat the tests 
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applied the previous year, Tests 55 and 56. This difference had to be ignored : 
age-allowances provide for differences in age at testing, and the Moray House 
Tests are intended to be, and generally are, comparable with one another. 
All the test scores were re-standardized on this sample at each age, to equate the 
means and standard deviations and to produce normal distributions in each 
test, thus eliminating differences in standardization between tests. Means of 
100 and standard deviations of 15 were imposed in a conversion based on scores 
grouped in intervals of five points: interpolations within intervals resulted in 
means which worked out slightly below 100 and standard deviations below 15. 
The means and standard deviations of the original scores and of the scores 
after conversion are set out in Table 1. 


TABLE 1 


MEANS AND STANDARD DEVIATIONS BEFORE AND AFTER RE-STANDARDIZATION. 


Original Scores 


Test at Age— 
Mean 


108-85 
109-70 
104-95 
106-41 
107-90 





The next step was to find out age at menarche for each girl. The investiga- 
tion was restricted to girls so that the age at first menstruation could be used as 
an indication of whether puberty came early or late. Since the sample was aged 
about 13 or over, menarche was a fairly recent event for many of them and it 
was, therefore, not necessary to rely on distant and possibly faulty memory. 
A considerable proportion had not yet reached menarche, but it was still 
possible to separate the early developers and the very early developers from the 
remainder. Existing medical records did not include age at menarche ; but the 
School Health Service in the city undertook to interview individually all the 
girls in the sample to ask for this information. In three schools, the head pre- 
ferred that the information should be got by writing to the mothers of the girls. 
This was done and practically all replied. Difficulties at this stage of the inquiry 
had been anticipated and particular attention was given to the method of 
approach. No one was scandalized, no protests were received, and information 
on age at menarche was obtained for 98-8 per cent. of those tested at age 13. 


Information on social class was based on the answers to carefully designed 
questions on father’s occupation. On this basis, pupils were graded into six 
groups on a modified version of the Registrar-General’s Scale: I and II, 
professional and business; IIIa non-manual skilled workers; III skilled 
manual workers; IV semi-skilled manual workers; and V unskilled workers. 
When a pupils description of the father's occupation was too vague to be 
classified, the pupil was asked, through the headmaster, for further information. 
The answer “ Don't know (father's occupation) " was itted, to allow for 
illegitimate children and other, special cases. Only 2-3 per cent. of the total 
group could not be given a grading in one of the six categories and it was possible 
to establish from the detailed records of the Aberdeen Maternity Hospital 
that a considerable proportion of these were, in fact, illegitimate children. 
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IIL—ANALYSIS OF RESULTS. 


Complete test records—that is, scores on tests at ages 7, 9 and 13, and on 
two tests at age l1—were available for 1,402 girls. Information on menarche 
was lacking for seventeen of these, leaving a total of 1,385, from whom the 
following results are derived. The age range at the time of interview included 
six pupils who were under 13 years, and six over 14 years 6 months, but 91 per 
cent. of ages at interview were between 13 years 3 months and 14 years 2 months. 
The median age at menarche in the sample was 13 years 2 months. Menstrua- 
tion began before the age of 11 in twenty-four cases, under 2 per cent. ; before 
age 12 in 165, or 12 per cent. ; and before age 13 in 40 per cent. of the group. 
Age at menarche was classified in three categories : 


(i) The very early maturers, with menarche before the age of 12 years 3 
months. 


(ii) The early maturers, with menarche between the ages of 12 years 3 months 
and 12 years 11 months. 


(iii) The remainder, among whom menarche occurred at or after the age of 
13 years, or who, being already aged 13 years, had not yet reached 
menarche. (There were two girls not yet at menarche who were aged 
between 12 years 9 months and 12 years 11 months, and these were 
included in this third group). 


This third group, therefore, included both the median group and the late 
maturers. It was not practicable to distinguish these. There was, in the sample, 
a group of 358 girls among whom menarche occurred after the age of 13 years 
9 months. The mean scores of this group proved to be about five points of 
I.Q. below the average, but this was a spurious result arising from the nature 
of the sample. Where a sample is drawn from an educational year-group, there 
is a tendency for the brighter pupils to be younger than average and the duller 
pupils to be older. Younger pupils who were to prove late developers had not 
reached menarche at the time of interview. An attempt to establish a category 
of definite late developers from the available data would have missed these 
younger pupils and resulted in a group with a bias towards older age and lower 
ability. All pupils who were neither early nor very early maturers were, there- 
fore, grouped together. 


The same objection does not apply to the 'early' and 'very early 
maturers,' with whom menarche occurred before age 13 years or before 12 years 
3 months, respectively, as these groups could be identified with certainty. 
It is thus possible to derive results which would test the hypothesis that the 
early maturers would show a spurt in mental growth around the onset of 
puberty, but not to obtain information on the effect with late maturers. It is, 
however, the early maturers who are concerned in the argument about the 
influence of puberty on performance in selection tests at the age of 11. The 
hypothesis under test is that, if there is a spurt in mental development with the 
onset of puberty which is beginning to affect performance in testing at age 11, 
this would appear as a sudden improvement of the very early maturers at the 
age of 11—an improvement, that is, relative to any possible superiority that 
they may previously have shown over the early maturers and the remainder at 
the age of 7 and 9 before any effect of puberty has begun to operate. By age 13, 
on this hypothesis, the lead of the very early maturers over the earlies would 
have been reduced, as the earlies, and perhaps some of the remainder, came to 
benefit likewise from this pubertal spurt. 
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The mean scores of the very early and early maturers and the remainder, 
which are set out in Table 2, do not show this pattern of results. The very early 
maturers are certainly superior at the age of 11, but this is not a sudden emerg- 
ence of superiority with the onset of puberty. They are almost the same degree 
. ahead at ages 7 and 9. In fact, the pattern of mean scores between ages 7 and 
11 is, if anything, the reverse of what would be expected on the hypothesis of a 
pubertal spurt among very early maturers at age 11. Differences between the 
means of the menarche categories at any one age are generally significant 
(the ' very early ' at all ages over both ‘early’ and ‘remainder’; the ‘early’ 
only at ages 7 and 13 over ' remainder’). 


TABLE 2 
MEAN TEST Scores oF MENARCHE GROUPS. 


Menarche 
Mean Test Score at Age— — Remainder 


Very Early 











7 103-5 
9 102-8 
1 
1 
1 


(1) 102-9 
(2) 102-6 
108-3 


N 246 
Percentage entering Senior Second- 
ary Sch 





Beyond the age of 11, with the results at age 13, there is some suggestion 
that the early maturers are pushing ahead of the remainder. The differences 
between means at various ages for any one menarche category are not significant 
until we reach age 13: at this age, the drop in mean score of the ' remainder ' 
category is significant. An alternative expression of this point is that the 
distance between the means of the ' very early ' and ' remainder ' categories 
increases significantly between 11 and 13. This may mark the beginning of a 
pubertal spurt among the very early maturers, though without a further 
testing at age 15 (now impracticable as many of the pupils have already left 
school) the data are inadequate to establish such a finding. However, it can be 
concluded from the present data that the earlier onset of puberty among girls 
is not seriously affecting test performance at age 11. 


It should, perhaps, be emphasized that within each menarche group there 
is a wide range of test scores at each age, the very early, early and remainder 
categories all including pupils whose scores range from the highest to the lowest. 


The relation of social class (assessed by father's occupation) to age at 
menarche and test score at various ages can be examined from the data. There 
is, of course, an association between social class and test score: the average 
of girls from Social Class I was between 114 and 117 on all the tests, and the 
average score of those from Social Class V was between 93 and 95. Again, there 
was a wide range of scores in each category. The superiority of the very early 
maturers cannot, however, be explained as merely the product of class differ- 
ences. Even within any one social class, the very early maturers are consistently 
above the average for that class. The mean scores for each menarche category, 
separating the social class groups, are set out in full in Table 3. 
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TABLE 3 


MEAN TEST SCORE AND MENARCHE, By SOCIAL CLASS. 


Mean Score at Age— 
Social Class | Menarche 
Category 11 (2) 








Social class not stated: VE 4; E5; R23 








Sub-division by social class reduces the numbers in individual categories of 
the table, resulting in slightly larger chance fluctuations in the means at various 
ages. However, breaking down the data by social class produces no evidence to 
affect the conclusion derived from Table 2. Within each social class, the very 
early maturers generally have higher mean scores than the rest, but this appears 
at all ages and not merely with the onset of puberty. 

Some association between social class and age at menarche might reasonably 
have been expected. When the investigation was being planned, the physio- 
logically-minded among our advisers were inclined to predict that the early 
maturers would tend to come from the upper social groups, being taller, heavier 
and quicker to mature because of better nutrition. Some teachers, on the other 
hand, pointed out that girls from the poorer urban areas give the impression 
of earlier maturity in their appearance, dress, behaviour and interest in boys, 
and, in fact, they tend to marry earlier. Table 4, which presents the numbers 
and proportion in each menarche category for each social class, does not support 
either of these views: there 1s no marked association between age at menarche 
and social class in this sample. The proportion of very early and early maturers 
is nearly the same among daughters of professional and business men as among 
the daughters of unskilled labourers. 

The data for Table 4 were drawn from all those for whom information on 
both social class and age at menarche was available, that is, 1,385 girls on whom 
the previous tables are based, less thirty-two for whom a social class grading was 
not available, plus 211 girls for whom test records were incomplete (mostly from 
private schools and social classes I and IT). A parallel analysis of the 1,385 
cases gives similar results. 
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T ABLE4 


DISTRIBUTION OF MENARCHE BY SOCIAL Crass, 


Menarche 


Category © |, mM 
I/II IIIa v 








Very early | 50 (16) | 49 (21) | 119 (18) | 24 (13) | 31 (16) | 273 (17) 


67 (22) | 44 (19) | 131 (20) | 48 (26) | 46 (24) | 336 (21) 
189 (62) | 135 (59) | 404 (62) | 110 (60) | 117 (60) | 955 (61) 
306 (100) | 228 (100) | 654 (100) | 182. (100) | 194 (100) 11564 (100) 











(Percentages in brackets.) 


Chi-square on this table is 8:6, with 8 degrees of freedom, and is not significant 
(P about 0-4). 


The inter-correlations between the test scores at the: various ages were as 
follows : 


Between score at age 7 and score at age 9: -+0-69 
age 11 (1): +072 — 
age 11 (2): +0-75 

age 13: +-0-74 

Between score at age 9and score at age 11 (1): -+0-82 
age 11 (2): +0-82 


age 13: +-0-82 
Between score at age 11 (1) and score at age 11 (2): -+0-93 
age 13: -+0-90 


Between score at age 11 (2) and score at age 13: +0-91 


IV.—CONCLUSIONS. 


The results from this analysis of the test scores of 1,385 girls at ages 7, 9, 
11, and 13, show that the anxiety that earlier onset of puberty is affecting 
performance in selection tests at age 11, is unfounded. On other aspects of the 
effect of puberty on mental development, the data do not permit any definite 
conclusions to be reached. The results at age 13 can be interpreted as indicating 
the beginning of a possible later spurt. It is also possible (some may say 
probable) that the onset of puberty affects aspects of mental development 
which are not measured by the tests used. Moreover, the investigation was 
concerned principally with a comparison of the early and. very early maturers 
with the remainder, and the data were not appropriate far a study of the late 
maturers. Whatever the causes of the earlier onset of puberty, the evidence so 
far does not show any relationship between early onset of puberty and social 
class. Other aspects of this investigation, including family size, migration, type 
of secondary school attended, area of residence and birth weight, will be 
discussed in a subsequent paper. 
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THE MEASUREMENT OF LEARNING ABILITY 


By G W. S. MACKAY anp P. E. VERNON 
(University of London, Institute of Education) 


Summary.—Intelligence tests are often assumed to predict a child's ability to 
learn, although they are seldom based on situations that directly involve any 
learning. A battery of nine group tests was devised for junior school children, 
each based on the learning of an unfamiliar task, and yielding six scores for 
successive stages of practice. These were applied to 8-9 and 10-11 year pupils, 
along with a number of more conventional tests, in order to find whether they 
added appreciably to the predictions of achievement derived from the previous 
year's work. As a measure of learning, residual gain was used, i.e., final scores 
holding initialscores constant. 

These residual gain measures were found to be largely specific in the younger 
group, though somewhat more consistent and more predictive in the older 
group However, the tests differed considerably in their validities, and further 
work might produce more useful ones. There are considerable difficulties in 
finding tasks relevant to school progress which will also cover a wide range of 
ability. A stronger case might be made for using final scores, after a period of 
instruction and practice on new tests, as predictors. 


I.—INTRODUCTION. 


THE main object of giving intelligence tests to children in school, or at an 
educational clinic, is to assess their capacity to make scholastic progress. This 
implies that the tests predict ability to learn ; and, indeed, several writers have 
defined intelligence as learning ability (Buckingham 1921, Dearborn 1921). 
If, however, we regard learning as changes in performance following instruction 
and practice, then, with Woodrow (1946), we '' cannot simply state that im- 
provement with practice is identical with some hypothesized intelligence.” 
Much the same point is made by Simrall (1947), who adds that there is little or 
no evidence that gains due to practice on any one test are associated with gai 
on other tests; i.e., there is no such thing as generalized learning ability. 
Intelligence is normally measured by tests which are given once only, and which 
provide no opportunity for practice or the acquisition of new mental skills. 
Hence, they bear very little relation to the amount of progress children are 
likely to make in their school work over a given period, say a school year. And 
yet they still correlate highly with future overall scholastic achievement. 
J. E. Anderson (1940) has pointed out the main explanation of this apparent 
paradox, namely that mental development is always cumulative. A child 
starts the year with a fund of information and mental skills, X, which are very 
similar to those involved in doing an intelligence test, I; during the year he 
acquires further skills, Y. Now the correlation of I with final achievement, 
X+Y, is still quite large because rx is large, although the correlation rry 
may be very small or even zero. 


Surely, it should be possible, then, to devise tests which actually involve 
learning in a controlled situation, and which might add to the predictions of 
later performance already covered by intelligence tests. Such tests should be of 
considerable value to the psychologist and/or the teacher. Furthermore, they 
should provide even better opportunities than current intelligence tests for 
observing and assessing children's methods of tackling difficulties, their 
attention, perseverance, etc.—qualities which are highly relevant to future 
success or failure. 
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II.—PREVIOUS INVESTIGATIONS. 


A considerable amount of work on the factors involved in various types of 
learning has been carried out, though little of it has much bearing on scholastic 
achievement. Anastasi (1930, 1932), Thurstone (1938) and others, have demon- 
strated the existence of rote memory factors in the learning of paired associates, 
words, digits, etc. ; for the most part, these seem to be fairly specific to the type 
of material and the conditions of presentation and recall. There is no evidence 
that they correlate with school learning, except in so far as they involve g or 
verbal ability. Vernon (1961) claims, in his review of the field, that logical 
memory or the learning of meaningful materials depends wholly on these 
familiar factors, i.e., that no special faculty of meaningful learning can be 
distinguished. 

Another approach is represented by the investigations of Melton (1947) and 
Fleishman and Hempel (1954, 1955) into psychomotor learning, where measures 
of proficiency at different stages of practice were factorized, and changes in 
factor patterns were obtained as the learners became more proficient. In the 
early stages of practice at complex co-ordination or discrimination reaction 
tasks, there were considerable correlations with spatial, mechanical and other 
cognitive factors, suggesting that learners are tackling the task by means of their 
already established abilities. But at later stages there is less dependence on 
these familiar factors, and other motor factors become more prominent, or a 
factor specific to the particular task emerges. This explains why it is so difficult 
to predict ultimate motor skills either by paper-and-pencil aptitude tests or from 
success during the early stages of learning. A similar investigation of learning a 
short musical theme was published by Burroughs and Morris (1962). 


Allison (1954, 1956) and Mollenkopf (1957), as part of a programme for 
selecting U.S. Navy personnel, designed performance tests involving the learning 
of mechanical tasks, in which subjects were instructed in assembling various 
models, and tested after each period of instruction. They found that these 
learning scores predicted achievement at the end of mechanical training at 
least as well as the Navy's Basic Battery of aptitude tests, and that they were 
capable of adding appreciably to the predictive efficiency of this Battery. 
Although no similar investigation has been carried out with scholastic materials, 
these results seem to offer some promise that observation and measurement of a 
pupil's progress in a learning situation might likewise improve on the prediction 
of his future achievement. G. R. Ortar (1960) was faced with the educational 
selection and guidance of Israeli children from very varied cultural and linguistic 
backgrounds. Finding that scores on standard tests were seriously affected by 
such differences, she introduced a period of coaching and, on retesting, obtained 
much improved validities in predicting school achievement. She did not, how- 
ever, make use of measures of the gain brought about by coaching. 


The most quoted studies of actual learning gains are those of Woodrow 
(1938, 1939), who practised two groups of subjects repeatedly on a number of 
simple tasks—cancellation, arithmetic, anagrams, length estimation, etc. 
Here also the factor content of the initial and final scores differed considerably, 
and there was little evidence of any general factor among the gain scores, nor of 
overlapping between the amount of improvement and intelligence. Neverthe- 
less, gain scores on different types of task, e.g., numerical, perceptual, did show 
some positive correlation with N and P factors. One reason for Woodrow's 
negative results may have been that the tasks employed were of a rather simple 
kind, and Tilton (1953) claims considerably greater consistency in an investiga- 
tion of school subject learning. Here, standardized achievement tests in eight 
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subjects were given to fourth grade pupils, and repeated twenty months later. 
There were marked positive correlations, averaging 0-30 (or 0-51 when corrected 
for attenuation) between the gain scores, indicating a prominent general factor 
in school progress. Gains also correlated positively with initial achievement ; 
i.e., those who were more able in any subject tended to progress better in it 
than those who were initially weak. 


More recently, two large-scale investigations have pes reported from the 
Educational Testing Service, Princeton, N.J: Stake (1958) applied a dozen 
learning tasks, each involving several trials, to 240 seventh grade pupils. 
Both verbal and non-verbal, rote and meaningful, tasks were used, and an 
extensive battery of reference tests was added. Each learning task was scored 
for total errors or overall success, for slope of the learning curves, and for 
regularity of progress. The last set of measures yielded mostly negligible 
correlations, bot the success measures correlated positively with intelligence and 
with scholastic achievement. On factorizing his complete battery, Stake con- 
cluded that there is no general learning ability, but a number of factors for 
different kinds of tests ; also that there is little relation between slope or rate of 
learning and familiar psychclogical factors. 


Allison's (1960) study was similar, but it included a greater variety of 
learning tasks (psychomotor, mechanical, rote learning and concept formation), 
which were given to 315 adult recruits. These were scored for average rate of 
progress, and for faster or slower progress in the second than in the first half of 
trials. The main factors among these measures included.: (1) Conceptual learn- 
ing ; (2) Mechanical and motor learning ; and (3) Rote learning. On comparing 
these with the factors established in a battery of thirty-nine reference tests, he 
observed, unlike Stake, that No. 1 was related to general verbal intelligence and 
number ability, No. 2 to a spatial-mechanical factor, and No. 3 to a rote memory 
factor. 

It would seem then that learning progress is certainly not identical with 
intelligence, though there may be some positive overlapping when complex 
learning tasks are studied. There is little evidence, also, of a general learning 
ability, common to all types of task, though this too may be affected by the level 
of complexity of the tasks. Rather, learning measures tend to group under a 
number of rather ill-defined factors, according to the type of material and the 
conditions of learning and recall. Some of these group factors do seem to relate 
to ability factors, for example verbal learning correlating more highly with 
verbal ability, non-verbal learning with spatial ability, etc. 


IIL—THE PRESENT EXPERIMENT. 


Our own small-scale experiment was conducted against this background 
with two groups of approximately thirty-six primary school children, aged 8-9 
years and 10-11 years, being the complete unstreamed second and fourth years 
of a school in a good middle-class area. A study with these numbers can be 
regarded only as a first pilot exploration ; but it may suggest ways of tackling 
a larger sample with more refined techniques if worthwhile results are indicated. 


The aim was to devise a series of group learning tests which would approxi- 
mate more closely to the normal school learning situation than those employed 
by Stake. Group tests were preferred, not only because the tests would in- 
evitably be too lengthy for individual application, but also because children 
might well learn differently when individually instructed from the way they 
learn in class. Each test was completed in a school period; i.e., long-term 
learning and retention were not involved. Al! tests were given during January, 
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1961, by the first author who, as head master, had often conducted classes with 
these children. In addition, a series of conventional ‘ reference ’ tests of verbal, 
numerical and spatial abilities and scholastic attainments was given, partly 
with the aim of exploring the factorial content of the new learning tests, partly 
in order to find whether the learning tests added significantly to conventional 
tests in predicting achievement. The final criterion of achievement consisted of 
a combination of the class teacher’s estimates of progress made during the 
remainder of the school year, and standardized English and arithmetic tests. 

An inherent difficulty in devising learning tests is that the more closely they 
parallel school learning, the more they are likely to be affected by previous 
experience and achievement. Another problem is to ensure that they will 
cover a wide enough range of ability to be useful throughout the junior school. 
Ideally, a dull 8 or even 7-year-old should be able to make a start with them 
before he receives practice ; while at the same time a bright 11-year-old should 
still be improving at the end of practice. A description of the tests will show how 
these difficulties were tackled. Details may be found in Mackay (1962). 


(1) Language Learning Test. This test attempted to teach the children a 
new language that they could translate from or into. Esperanto was used 
because of its inherent logical structure, and because it seemed unlikely that any 
children would be familiar with it. The test was a printed one and was tackled 
without any time limit, although it became clear that after about 40 minutes, 
the children had reached their limit. The initial instructions were not standard- 
ised ; they were designed to reduce anxiety by giving the impression that this 
was a normal lesson, not a formal test. But once the nature of the task had been 


pointed out, the children carried on by themselves from printed instructions. 

The first exercise showed a few easy English words with their Esperanto 
equivalents. Those carrying the definite article were prefixed with La, those with 
the indefinite article merely had the equivalent without La. This exercise 
expected the children to grasp this rule, and they had to use it in translating 
the given words. The rule was then carried forward to the next piece of informa- 
tion, and so on until sentences could be translated using the rules governing the 
pronouns, verb tenses, plurals, agreement of adjectives with nouns, etc. Finally, 
a complete piece of translation could be, and was, attempted by the children. 
In order to give an initial, final, and total score, the first third, last third, and 
total number of answers were marked separately. 


(2) Number Substitution Learning. This test was similar in structure to the 
Language Learning. A simple form of algebra unfamiliar to the children was 
used. Letters were given equivalent numbers, and after some straightforward 
interpolation, additional rules were supplied involving addition, subtraction, 
multiplication and division, which, although familiar, to the children, did depend 
upon previous experience. It seemed impossible to deal with numbers in any 
complex way without assuming a knowledge of these processes. The rules, 
however, had to be applied in a way that was not familiar, and it was the inter- 
pretation and progressive learning of the rules that was primarily involved. 


(3) Verbal Instructions Learning. This test was given six trials, as was 
every subsequent test. The children were provided with six printed sheets, 
each sheet containing eight rows, each row consisting of two numbers and two 
letters. The tester read the instructions for all eight rows: e.g., put a ring 
round number seven and cross out letter A. After hearing the complete list, 
the children answered as many items as they could on their first paper. The 
full instructions were read again, and a second attempt was made with a new 
sheet. This was done six times. 
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(4) Verbal Information Learning. The above test involved rather meaning- 
less material. This test presented a kind of learning more similar-to what children 
meet at school. Twenty-five pieces of information were read out; these were 
simple, but unlikely to be known by any of them : e.g., Bach died in 1750; 
the first five letters on an English typewriter are OWERT. After a first reading 
of the complete list, the children answered twenty-five questions based directly 
on the information. Six trials were given, and the order of asking the questions 
(not of reading the statements) was altered each time. — - 


(5) Word Memory. A list of Sixteen three or four-letter words was read 
out to the children, and they were given two minutes to write down as many as 
they could remember. (In practice they were allowed as much time as they 
needed). This was repeated six times. 


(6) Percepiual Detail Learning. The above tests covered the kind of 
material, the children learn in the verbal and number aspects of their school 
work. This test and the two following attempted to see how they learn using 
visual cues. The children were briefly shown a large card with twelve circles 
representing faces Each face was incomplete in some detail of eyebrows, 
eyes, nose, and mouth. No other features were shown, and those that were 
shown used only dots or straight lines ; no drawing skill was involved. Having 
looked at the twelve faces for eight seconds, they then completed a sheet con- 
taining twelve blank circles. This test was repeated six times. 


(7) Geometrical Form Learning. This test presented a large, rather com- 
plex, geometrical figure, built up from squares, circles, triangles, diagonals, 
etc., twenty distinct points in all. After looking at the figure for eight seconds, 
the children attempted to draw this figure. Again, six trials were given. 


(8) Symbol Maze Learning. Some kinaesthetic learning was added to the 
learning of visual patterns. Each child had a printed maze, and the class was 
shown eighteen symbols that could mot be crossed on the maze. After seeing the 
forbidden symbols for eight seconds, the maze was attempted. Six trials were 
involved. 


(9) Group Concept Formation Test. Although only one test of this type 
was included in the battery, we feel that there is much to be learned from a 
battery of tests exclusively of the concept-formation type, some being individ- 
ual. Bruner, Piaget, Annett (1959) and others have indicated the importance 
of this aspect of learning, and further studies are clearly needed. This test 
involved the linking of nonsense syllables with pictures. The first example 
showed a horse—TIX, a dog—TIX, a man—-FON, and a woman—FON. 
The children were then shown (a) a baby and (b) a cat, and asked to write the 
appropriate syllables after each. There were fourteen trials in all, some being 
repeats, with the complexity of the concepts increasing. Concepts in the middle 
range included things that fly, things that make a sound, and people in uniform ; 
and at the end of the scale—things nocturnal, physical ' action, and modes of 
transport. In the older group at least it was clear that children had benefited 
from experience with the early, easy, concepts when tackling the later, difficult 
ones. 

Reference Tests’ The remaining tests in the battery were as follows: 

10) A forty-item, multiple-choice, vocabulary fest. : 
11) A reading comprehension test, similar in form to the Watts-Vernon test. 
12) Schonell's Essential Mechanical Arithmetic. ; 
(13) Schonell’s Essential Problem Arithmetic. 
(14) A twenty-item number series test.: - 
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(15) Digit memory, fourteen items ranging from three to nine digits; this was 
given orally and responses were written. 

(16) A spatial test involving the copying of printed figures onto a dotted 
space. 

(17) Goodenough Draw-a-Man test. 

(18) Bender-Gestalt, given as a group test with each figure shown on a large 
card. 

(19) Overall school attainment had already been examined in 1960, and 
similar tests were given in the Summer of 1961 to provide a criterion of 
later achievement. 


IV.—ScoRING. 


Mollenkopf and Allison seem to have used the total scores on all trials as 
their measure of learning. While these obviously reflect improvement, they 
depend very largely also on initial ability. Woodrow, on the other hand, took 
final-minus-initial scores, that is the absolute gain. The disadvantage here is 
that progress on most tasks is unlikely to be linear; thus, there can be no 
guarantee that the units are equal at different parts of the scale. For example, 
it might be easier for those who start with a score of 10 to finish with a score of 
30 than for those who start at 40 to finish with a score of 50. Stake plotted 
individual learning curves for each task, but eventually employed three meas- 
ures for each curve, namely total success, slope or curvature (i.e., a measure of 
gain), and goodness of fit (i.e., regularity of progress). Since the latter measures 
yielded few significant correlations with any other tests, we decided to study 
total score, initial score in the first two trials and final score in the lest two 
trials ; (the first and last would have yielded bigger differences, but would also be 
likely to be poorer in reliability). However, the amount of progress was determ- 
ined by correlational analysis, as follows, not by subtracting initial from final 
Scores. 

Manning and DuBois (1958) argue that the most appropriate measure of 
learning is ‘‘ that portion of the measure of final status which is not correlated 
with initial status." Thus, if we denote initial (or early) performance as A, 
final performance as Z, and compare these with some criterion, C, or with any 
other test, then the correlation of effective gain with'C will be :* 


T(Z-A) = Tzc — Taclaz 

Manning and DuBois do not indicate how one should correlate gain scores 
on one test with those on another. But we can extend the above formula 
to Tests 1 and 2 as follows : 

T(Zy - Aj) (Zi - Aa) = T2yZ_ F TAA TA 20 Ag, STAZ TAZ, 77 TAZ TAZ 

Each of the nine learning tests yielded a total, an initial and final score in 
each of the two groups. With nine reference tests, 1960 attainment, 1961 
estimates and 1961 attainment, this means thirty-nine variables in each group. 
The 1,482 product-moment correlations were kindly calculated by the London 
University Computer Unit,t and the gain score r's were derived from these by 
hand, according to the above formulae. The correlations of each measure with 


* Presumably this expression should be corrected, as in the ordinary formula for 
partial correlation, by being divided by 4/(1l—r*,c) (1—r*4;). However, it seemed better 
to omit this step, since it resulted in boosting many of the Gain score correlations (because 
Taz was usually very large). In other words, we followed the same procedure as in calculat- 
ing the residuals in a centroid factor analysis. 


T These may be borrowed on request to the authors. 
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the two 1961 (criterion) variables were generally very similar; hence they were 
combined by Correlation of Sums formula. The resulting validities are shown in 


Table 1. 
TABLE 1 


CORRELATIONS OF LEARNING TEST GAIN AND TOTAL SCORES AND REFERENCE TESTS WITH 
ATTAINMENT CRITERION. 


Gain Scores | Total Scores 





89 | 10-11; 8-9 | 10-11 








Language Learning | -163 | -201 | -810 | -598 

Number Substitut'n| -011 | -547 | -655 | -787 | Reading Compr. .. 
Verbal Instructions | -256 | -206 | -563 | -189 | Mechanical Arith. 
Verbal Information} -199 | :203 | -762 | - Problem Arith. 
Word Memory ....| :112 | -276 | -770 | - Number Series 
Perceptual Detail..| -306 |- 042 | -517 | -198 | Digit Memory 
Geometrical Form :134 | -361 | -317 | -102 | Spatial Test 
Symbol Maze ....| :150 | -332 | -465 | -367 | Goodenough Draw. 
Concept Formation | -065 | -251 | -535 | .574 | Bender-Gestalt .. 
1960 Attainment. 











V.—RESULTS. 


It may be seen that the validities of the Total scores are similar to those of 
the Reference tests, though rather variable. None is as predictive as all-round 
attainment at the beginning of the year. By contrast, the Gain score validities 
are low; yet all but one are positive. Note that there is no consistency here 
between the two age groups, suggesting that the tests with near-zero validities 
are those which were too difficult for one group to register much progress 
(e.g., Number Substitution for 8-9 years), or too easy for the other (Perceptual 
Detail for the 10-11 years). In any case the Standard Errors of correlations 
based on thirty-six cases are so large that inferences as to the worth of particular 
tests would be unsafe. For the same reason, neither multiple regression analysis 
nor factor analysis seemed likely to help in elucidating the results. Instead, we 
condensed the correlation matrices by Correlation of Sums formula to yield the 
correlations among groups of similar tests. These are shown in Table 2 (8-9 year 
above, and 10-11 year below, the diagonal). 


TABLE 2 


INTER-CORRELATIONS AMONG COMBINED GROUPS OF VARIABLES, 





Non-Verbal Gain .. 
Verbal Total 
Non-V. Total .... 
Verbal Ability . . 
Number Ability 
Spatial Abihty .... 
1960 Attain. 

1961 Criterion 








5 
4 
5 
4 
2 
4 
3 
I 
2 
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- All the gain scare correlations with other variables are noticeably lower in 
the 8-9 year than the 10-11 year group, indicating poorer reliability and greater 
specificit y among younger children. Even the combined Verbal Gain measures 
(Nos. 1-5) correlate only 0-131 with the combined Non-verbal measures (Nos. 
6-9), as Mind 0-491 in the older group. In contrast the two sets of Total scores 
show very similar relations to other variables in the two groups (except to the 
non-verbal reference tests ; the Goodenough, Bender and Space test had very 
little in common among 10-11 year olds). Thus, overall performance at the new 
tests is quite predictive in the second year of the junior school, but children of 
this age do not show as consistent an ability to make steady progress at un- 
familiar tasks. 


Secondly, it may be noted that Gain scores tend to correlate more highly 
with Totals on the same tests than with any other variables. Children with gocd 
ability in any task do tend to progress more rapidly at this or similar tasks.* 
The same point is brought out by the mean correlations of the nine Gain scores 
with : f 


, 8-9 10-11 

Other Gain scores ....:..... os ‘051 -141 
Own Total scores ............ -375 -472 
- Other Totals ................ -113 . 174 
Reference tests and other variables +111 :215 


It would follow that rate of progress on any one task will not tell the teacher 
or psychologist much about performance on any other task, particularly with 
younger children. At the same time, Woodrow, Stake and other writers are not 
justified in regarding learning ability as almost wholly specific. For example, 
the mean inter-correlation of 0-141 among 10-11 year olds implies that our 
battery of Gain scores would: correlate 0-60 with another similar battery ; and 
doubtless better tests could be devised. Moreover, a certain amount of pattern- 
ing among the Gain scores may be observed in Table 2. In both groups the 
‘Non-Verbal Gains correlate more highly than do the Verbal Gains with the 
spatial-pictorial reference tests. However, the Totals for the learning tests 
show this patterning more consistently. 


Prediction of Attainment. The matrices were now further condensed by 
"combining all Gain scores, all Totals, and all Reference tests (but omitting the 
Perceptual Detail Learning and Goodenough in the 10-11 group). Multiple 
correlations with the 1961 criterion were calculated, and the beta weights were 
as follows : 





8-9 10-11 

i Gain scores combined ..... oe. 112 -093 
i Total scores combined ........ -546 —:003 
Reference tests .............. :231 :198 

` 1960 Attainment Piesdaxxcuas “i285 -613 
‘Multiple R .......... 939 -845 


.  * This might be expected from the derivation of the Gain scores, since the correlation 
of Gain with Total = rzg;—r4:r4z and this expression can be zero or negative only if 
Tar exceeds rz; In the light of Fleishman's work, it seems unlikely that a person's 
initial performance could more closely resemble his overall success than does his final per- 
formanco. 
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The results differ markedly for the two groups. In the 10-11's, the previous 
year's attainment is much the best predictor. The learning test Total adds 
nothing, but Reference tests cover some more ground, and Gain scores make a 
very small additional contibution. In the younger group the Gain scores add 
nothing to the other predictors ; indeed, their beta weight is negative. Previous 
year's attainment is somewhat less diagnostic than at 10-11, but both the Learn- 
ing Test Totals and the Reference tests do contribute appreciably. 


One other possibility is that the Final scores on the learning tests would 
not only be much more convenient than any form of Gain score but might also 
be the most predictive type of score. In so far as both the Total score and rate of 
progress must depend to some extent on the rather chancy initial reaction to an 
unfamiliar situation, it might be argued that the performance reached after a 
certain amount of practice at the new task would be a better indicator of 
scholastic capabilities. This is borne out among the 10-11 year pupils, the 
correlation of combined Final scores with the criterion being 0-705, which is 
higher than 0-661 for the combined Total scores. Note, however, from Table 2 
that it is still below the validities of the combined Reference tests and of 1960 
attainment, though it could probably be considerably improved if the more 
promising learning tests only were used. In the younger group the Final scores 
all give lower correlations than the Total scores with the criterion, though here 
too, some of the tests seem promising, e.g., Language Learning 0-797, Verbal 
Information 0-719. 


VI.—CONCLUSIONS. 


Our search for tests involving actual learning which might be more predictive 
of future scholastic achievement than ' cross-sectional ' tests has proved some- 
what disappointing, particularly with children in the second junior year, 
though more successful among older children. Apparently, such tests need to be 
graded carefully to the level of ability of the particular group of pupils, otherwise 
the amount of progress made in a limited time is highly unreliable and specific. 
However, it is not true, as some writers have maintained, that there is no 
consistency. When Residual Gain is measured, a considerable general learning 
factor appeared among the older children, and there was some indication of 
group factors for learning different types of materials—verbal, numerical, 
non-verbal, etc. Each Residual Gain score also correlated relatively highly with 
overall success at that particular task. 


The predictive value of the various learning tests that were tried out varied 
widely, yielding the expectation that a battery of successful ones could be devised; 
though much more exploration with larger numbers of cases is needed. Finally, 
it would appea: that rate of progress or gain scores possess no inherent advantage. 
It is the final scores achieved after instruction and practice at some unfamiliar 
task that yield the highest validities, and seem most likely to add to the predict- 
ive value of of conventional tests that are applied without such practice. 


Useful qualitative observations can be made during the tests of pupils' 
methods of work, concentration, etc. However, the outlook for individual 
learning tests which could be applied in the educational clinic does not appear 
very promising, both because these would have to be very lengthy to allow 
adequate practice, and because a child's reaction to the learning situation in the 
clinic setting might differ considerably from his reaction in the classroom. 
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CRITICAL COMMENTS 


TEACHING EDUCATIONAL PSYCHOLOGY IN TRAINING COLLEGES* 


The Report of a Joint Working Party of the British Psychological Society and 
the Association of Teachers in Colleges and Department of Education set up in 
1959 to study the teaching of psychology to training college students and to make 
recommendations. 

I A 


By T. H. B. HOLLINS 
(James Graham College, Leeds.) 


The Committee of Psychologists of the B.P.S. and the A.T.C.D.E. has produced 
a valuable report. It consists mainly of an analysis of the results of a questionnaire, 
on the teaching of psychology, sent to colleges in January, 1960, and recommenda- 
tions for the future. The tone is critical without being damning and the suggestions 
constructive ; they are not too idealistic to be practicable. The report is short and 
cogent ; it is remarkably free from the hazy, high-sounding generalizations which 
afflict so much writing in Education, particularly from committees. 

The picture presented here of the teaching of educational psychology in the 
colleges, at the end of the two-year course, is an unfavourable one. First, the pro- 
portion of unqualified lecturers teaching psychology is too big—39 per cent. (I shall 
speak in the present tense, as it is doubtful if the position has changed markedly 
since 1960) : and the position is worse than this figure indicates because, of course, 
some colleges have a number of specialists, more than their ' share,’ and too many 
colleges have none at all. In case it is argued that qualifications are unimportant, 
the report shows, from an analysis of examination papers in Education, set in 1959, 
of all the A.T.Os., that the teaching 18 unsatisfactory ; some of it is out-of-date, 
e g., questions on McDougall’s list of instincts ; most of it does not take account of 
recent work, e.g., there were very few questions on Piaget's contributions; and 
there were too many questions which could be answered merely by describing 
school experiences. In short, ' there was little evidence . . . that students were 
expected to possess specific knowledge in educational psychology as they might be 
expected to possess it in geography or physics or English." In view of these con- 
clusions, it is cold comfort to find that “ the allocation of time to the teaching of 
educational psychology is reasonable ” ; it is about 10—15 per cent. of the whole 
college course, and about 40 per cent. of the Education course. 

The picture would be even blacker, I think, if the analysis had gone further. 
A closer examination of the qualifications of specialists in educational psychology 
would reveal that too many of them are inadequate. Too many diploma courses 
allow students to qualify by theses describing children's behaviour on a basis of 
inadequate theory and a lack of rigorous experimental design with appropriate 
statistics. Wisemant pointed out some of these weaknesses in 1959; he analysed 
theses and papers in educational psychology and showed how few of them were 
concerned with the fundamental theory of learning and child development, and how 
strong was the bias towards descriptive studies of behaviour. Again, I am sure the 
committee would have been saddened by an analysis of the answers to the 1959 
questions, instead of the questions. In the first place, in my experience, the average 
student, answering a paper in which he has to choose four questions out of ten, 
avoids any question in which he may be forced to produce precise information. 
Secondly, he tends to give the same answer to the four he does choose, e.g., on 
treating maladjustment—affection, security and play ; on learning—the essentials 
are affection, security and play , in fact, 1f the rubric were to forbid the mention of 
‘ play ’ few would be able to answer at all, at least, amongst students training for 
the primary school. 

* Copies of the Report are available from the offices of the B.P.S. at Tavistock House 
South, Tavistock Square, W.C.1., or the A. T.C.D.E., 151, Gower Street, W.C.1., price 5s. 

free). 
ba M isse S. (1959). Trends in educational psychology. Brst. J. Educ. Psychol., 
29, 128-135. 
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Some excuse can be found for the shortcomings of the teaching if one considers 
such factors as the small amount of time available in the two-year course, the 
demands made on lecturers in most colleges, which prevented adequate study, the 
poor intellectual calibre of some of the students (and this report remarks on our 
tendency to spend a disproportionate amount of our effort on the near-failures), and 
the gaps and weaknesses in psychological knowledge. A worthwhile psychology text 
is difficult (particularly to women students, whose experience of mathematics has 
frightened them off any information presented in tables) and if students are getting 
nothing from their reading (or, worse still, learning off the text by rote) we are 
forced to increase the practical work (in itself, an essential approach to psychology 
and children) to the extent of leaving out most of the theory. Brearley,* in 1957, 
gives an outline of a practical course for ' primary ' students and she concludes, 
realistically, no doubt, “ Comparatively few students are ready to understand fully 
the tenets of the major schools of psychology, but we have greatly failed if at 
least some of them are not ready to study them when they leave." Collier,f in 
1960, working with men students, reports '' A difficulty is getting students to read 
effectively ...; ıt is the quality of reading that troubles us ” and finds that theory 
and practice only come together in ' Study-Practice’; he makes a good case for 

- & variety of methods, mainly practical, in this article, and some of his ideas are 
incorporated in a valuable section on methods in this report. As for psychology 
itself, the report rightly warns us to make“ a cautious use of psychological 
theory "; but this is putting ıt mildly. We have some certainties in the field of 
abilities, but in the field of learning, and personality structure and development, 
which are central to the teaching situation, we are among the '' Two-and-Seventy 
jarring Sects." In addition, nearly all the terms we wish to use in educational 
psychology have built-in value judgments, which lower their scientific value; the 
report regrets that so few questions are set on ' motivation,’ but maybe the 
lecturers, after reading a few pages of Peters’ Concept of Motivation, have shown a 
wise discretion. Of course, the study of psychology can sometimes be as good an 
education as the study of literature ; both will show the complexity of life and the 
unhkelihood of finding easy solutions to problems ; both might produce a desire to 
solve the problems, nevertheless , but few student-teachers are likely to reach this 
level of sophistication. Some lecturers, too, become tured of keeping up with what 
seem merely fashions 1n psychology. 


However, signs of discouragement are, in fact, seldom observed. The compara- 
tive neglect of many areas of educational psychology in the training colleges, up to 
1960, when the three-year course came in, was due, not to pessimism about the value 
of psychology, but to an optimistic concentration on an educational policy which 
took up such psychological contributions as reinforced the policy and neglected the 
rest. This policy was, and is, to make the schools child-centred. A training college 
educationist, especially in primary work, would see the task as persuading the 18-year 
old student to change from the concept of learning she would have gained in her 
recent education, 1.e , close study of a subject, to a concept of individual children 
learning in a controlled but permissive environment where social and emotional 
learning are as important as intellectual. 


In this atmosphere, gentle but insistent pressure is put on both student and 
child to adjust to the norm of the ' happy individual ’ which seems to be the ideal of 
the clinical psychologist. McDougall's theories seem to give psychological respect- 
ability to this aim—all the instincts and emotions working together, in the end, 
under the happy jurisdiction of the master-sentiment Freud will not do; sex and 
aggression seem always about to overcome reason and civilization ; or, if repression 
is complete, neurosis seems inevitable; and there is a distasteful pessimism in his 
dictum ' The function of education is to inhibit, suppress and forbid.' I am laughing 


* BREARLZY, M. (1957). The students’ understanding of children. Educ. for Teach., 
No. 42, 13-18. — 

+ Cottier, K. G. (1960). The teaching of psychology in training colleges. Brit. J. 
Educ. Psychol., 30, 103-108 
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only at the excesses of the movement and deploring only what sometimes amounts 
to indoctrination, because, of course, one must have admiration for a policy which 
has helped to produce our fine infant schools and which, even if it has not yet caught 
up fully with Piaget, gives our students a practical understanding of such important 
psychological concepts as individual differences, reading-readiness and ‘ learning 
through activity and experience’; .and which produces a flow of colourful and 
interesting books for teaching reading and number, with efficient methods for using 
them. 

What one wants, of course, is to retain the benefits of this crusading educational 
outlook and to add to it the benefits which come from the more dispassionate 
scientific outlook which takes a cool look at the evidence and believes in experiment. 
It is here that a deeper, more searching, study of psychology in the training colleges, 
led by a psychologist properly trained in experimental procedures, would be of 
value. Luckily, conditions for this to come about are more favourable than in 1960. 
The introduction of the three-year course will give students more time to mature and 
to-gain a better understanding of psychology. Even more important, there is a: 
determined effort, in the institutes and colleges, to ensure that members of staff are 
allowed more time for reading and research. The institutes have, for many years now, 
provided advanced teaching in psychology, but nów many more lecturers have the 
time and opportunity to attend them. Then, the expansion of the colleges has meant 
that the large college is now the rule, rather than the exception, and this means that 
the appointment of a qualified psychologist in nearly every education department is 
a possibility. Further, as the report points out in the chapter on ‘‘ The Standard of 
Students," the colleges '' continue to attact an appreciable proportion of students 
who have high natural endowments ” and we know that, owing to the inability of 
the universities to expand quickly enough, we are likely to get an increasing propor- 
tion of highly intelligent students, although the range of ability is likely to remain a 
wide one. It 1s against this background that the recommendations of the report 
appear particularly relevant and practicable. All of them are important, but the 
following seem to me the four which would contribute most to a rapid development 
of the sound teaching of psychology and to the growth of a scientific outlook amongst 
the better teachers : 

(1). It is desirable that at least one member of the staff should have a degree in 
"psychology or its equivalent in a Bachelor's, Master's or Ph.D. degree in 
Education, in which the main content of the course is in the field of edücatinal 
psychology. 

(2) ...the teaching in educational psychology, being a co-operative taski in most 
colleges, requires that members of staff concerned with the subject should work 
as a team and the fullest use should be made of the specialist knowledge of the 
lecturer best qualified in educational psychology. 

(3) More use should be made of the facilities which exist for obtaining further 
qualifications through full-time and part-time study. (The facilities are sum- 
marized in Appendix 6.) - 

(4) We press strongly for the inclusion of more advanced optional courses in educa- 

-- tional psychology for the abler students. 3 

Itis particularly desirable that advanced courses in psychology should be made 
available to the more gifted students. Although the chief reasons for this are mainly 
of the idealistic kind, already mentioned, ıt should not be overlooked that the 
training colleges are hoping to have their certificate recognized in some way by the 
universities and even to have their better students working on degree-standard 
courses in Education. This would link up with the growing demand to have 
Education, and particularly educational psychology, as a first-degree subject in 
universities. 

Of course, no startling advances can be looked for immediately. The chief 
obstacles to the development of psychology as a science lie outside the ranks of 
psychologists and training-college lecturers. Some of these are—the indifference, 
or even contempt, of universities, public schools and grammar schools towards 
psychology, which means it will continue to be starved of its proper share of the 
top-level of ability, and of funds for research, for some years yet ; the large classes 
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ın schools, which force the teacher to be unduly concerned with olass management 
and keep him suspicious of a science which thrives on the variations amongst 
individuals; and the unenlightened teaching of mathematics and the physical 
sciences in schools, particularly to girls, which results in the inability of most 
students to study psychology beyond the anecdotal stage. Nevertheless, this report 
makes useful suggestions for another step forward. - 


II 


By J. M. THYNE 
(Jordanhill College of Education, Glasgow.) 


The Scottish colleges of education were not involved in the ATCDE/BPS 
inquiry, but the Report is, of course, of professional interest to them, and to their 
psychology departments in particular. 1 comment, however, as the representative 
of only one, albeit the largest, of them—with a present full-time student population 
of over 2,500. 

With the first part of the Report, most of my colleagues in the other colleges 
would be, I think, in general agreement. Child development (social, emotional, 
physical, intellectual, moral); the nature and conditions of learning as they relate 
to habits, skills, remembering, understanding, and transfer; the psychology of the 
basic school subjects; intelligence and its measurement, and applications to 
educational issues ; these have long been major parts of the psychology syllabus for 
women on the three-year course. (Our departments of psychology are concerned 
also with graduate students, with students holding diplomas in art, music, domestic 
science and other technical subjects ; but women on the three-year course form the 
largest group of students—over 1,600 at Jordanhill, this year—and it is to them that 
my comments here refer.) 

What to include in the psychology course is, nevertheless, a major problem, 
not because it is difficult to find material relevant to the training of teachers, for it 
is not, but because of the difficulty of deciding what must be left out. The relevant 
material has to be arranged in a hierarchy of relevance, and a line has to be drawn 
in accord with the time available, the claims of other subjects, the abilities of 
the students, their own psychological maturity, and other things besides. In 
contrast with the Report, the line I draw excludes the training of students to apply 
tests, but it does include an attempt to cultivate 1n the student an understanding of 
the history, structure, and function of tests, and an ability to interpret in educational 
terms the results of tests which will be applied to their pupils by persons with greater 
expertise. This example raises the question of the criteria to be applied in selecting 
psychological topics, a point raised again 1n my concluding paragraph. 

laminsympathy with the caution expressed in the same section of the Report, 
though I have some reservations. As the Report says, argument by analogy from 
animal learning, and from highly controlled experiments on human learning, is of 
course, dangerous. But I, and many of my colleagues, would regard a deliberate 
attempt to minimise this danger as one of the fundamental tasks of the psy chologist 
who undertakes to teach his subject to teachers. It is his special responsibility 
(not necessarily shared by his fellow-psychologists in universities) to ensure that 
what he says about learning (for example) is, 1n fact, valid for the kind of situation 
which will confront his students. And be will try to do this, not by ignoring the 
material in question, but by restructuring it, and testing in the school situation the 
results of his restructuring it. For example, analysis of human habit-forming 
indicates that its conditions of occurrence (if not its content) are basically the same 
as those made explicit in classical and instrumental conditioning. Also, it is not 
difficult to show that certain psycho-analytic mechanisms, such as Projection, 
occur within the lecture-room and the classroom. Caution is not enough ; it must 
lead to constructive thinking. Admittedly, the touchstone of practicality serves not 
only to give reality to what is taught but also to reveal the sterility of what one might 
otherwise have taught. It is also a humiliating revealer of one's own ignorance. 


Critical Comments IQI 


I am in complete agreement with the Report’s emphasis on the need to dis- 
tinguish between educational psychology and educational philosophy, but I have 
learned to be pessimistic about the possibility of making that distinction clear to all 
students. Nearly all the Scottish colleges have a department of psychology and a 
separate department of education (an arrangement of which I heartily approve), but 
this physical manifestation of segregation does not always make apparent to students 
what the difference in function is. To take but one rather blunt example: in many 
education seminars students are encouraged (very properly, I believe) to express 
their own opinions and develop their own judgements; whereas in a psychology 
seminar they may have to learn that the opinion of a student, as contrasted with his 
knowledge, is of little or no value. What matters here are the facts, and the hypo- 
theses validated by reference to them. They may have to learn also that the 
observation of two or three children does not provide an adequate basis for the 
formulation of general hypotheses about child development. (Such observation 
can, of course, be very useful for other purposes ) 

On the central part of the Report (the actual content and forms of psychology 
courses) it 18 difficult to comment Lists of topics, even when accompanied by tables 
of hours, do not answer the questions that really matter. What topics are expounded 
in detail and not merely mentioned in passing, or in a somewhat general way ? And 
just what details are emphasised ? How much of what appears on the syllabus, or is 
referred to in lectures, seminars, and discussions, 1s the student expected to know and 
remember? Is the material up-to-date? Even the analysis and classification of 
examination questions (which must have been a formidable task) reveals remarkably 
httle about the above issues, though some of the general questions quoted in 
Appendix 4 seem to imply that some parts of the courses must themselves have 
contained little detail. Iam tempted to suggest that more precise information might 
have been obtained from an analysis not only of the examination questions, but also 
of the students’ answers Good psychologists could scarcely object to a procedure 
widely used within their own discipline. 

Probably the major difference between the Scottish colleges and the colleges 
involved in the Report relates to the qualifications of the teachers of psychology. 
All lecturers in psychology in the Scottish colleges of education have an honours 
degree of which psychology is a main subject All are trained teachers, with experi- 
ence of teaching. Some have worked in child guidance clinics. In such a context 
Iam bound to comment that the recommendations of the Report in this connection 
are unduly mild, though I do appreciate many of the practical difficulties. 

The problem of suitable textbooks is serious, though slowly improving. So 
many of the texts which are sound on psychology have little or no bearing on the 
work of the teacher, and so many of those which are realistic about the classroom 
situation contain very little psychology—as psychologists nowadays would use that 
word. My own experience suggests that the ' general' textbook of educational 
psychology may have to dispapear from training colleges. When reasonably compact 
texts on specific topics (such as Memory, Learning, Child Development, Intelligence) 
are available, they are often much more useful than the thinly spread omnibus 
volume. The ‘ specific ' texts allow not only of relevant selection but also of special- 
isation. 

I have not attempted to pass comments on all parts of the Report, for such an 
attempt would occupy more space than it would deserve I think that the type of 
investigation described in the Report is valuable, and the committee responsible 
for it merits our thanks. Now, without in any way detracting from the value of that 
Report, may I suggest that a further task might profitably be undertaken, namely 
to discover the major problems actually faced by lecturers in psychology, and the 
ways in which they have been tackled? For example, what criteria are cited in 
selecting psychological material fora course? And why? Is therean optimum order 
of presentation? How do we solve the problem of breadth versus depth? But it 
would be part of such an investigation to discover what questions should be asked. 
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- THE RELATION OF NEUROTICISM AND EXTRAVERSION: TO 
INTELLIGENCE AND EDUCATIONAL ATTAINMENT 


A brief comment on the controversy between J. B. Biggs and R. pum 
Bv H. J. EYSENCK 


In the .controversy between Lynn and Biggs, the udal part appears to be 
played by the exact status of ‘anxiety.’ Biggs assumes that ‘anxiety’ and 
‘neuroticism ' are used interchangeably by me, but this is incorrect. Neuroticism 
is regarded as an inherited psycho-physical disposition, closely linked with the 
lability of the &utonomic system, which governs a person's emotional reactivity, 
and may predispose him to the development of neurotic disorders under suitable 
circumstances. Anxiety is a conditioned fear reaction which is particularly character- 
istic of dysthymic neurotics, i.e., of persons who are high ón the factor of neuroticism 
and also on the factor of introversion, which 18 significantly correlated with con- 
ditionability. Anxiety, therefore, is a ‘mixed’ concept, being related both to 
neuroticism and to introversion. (The position is rendered rather confused by 
Cattell's use of the terms ‘ neuroticism ' and ‘ anxiety,’ which is exactly the opposite 
of mine.) 


^ My theory, which links introversion and conditionability, is not necessarily 
antagonistic to that of Spence, which links neuroticism, regarded as a drive, to 
conditionability. On the theoretical level either theory, or both, or neither, might be 
right. My belief that Spence's theory is inadequate is based on the experimental 
evidence, and may very well be mistaken; the experimentally demonstrated 
correlation between the Manifest Anxiety Scale and tests of introversion make all 
the Spence-Taylor experiments using the M.A.S. equivocal. In our own work we 
have never succeeded in finding correlations between neuroticism and condition- 
ability, and neurotics as a group are more conditionable than normals only when 
they are selected in such a way that introverts are over-represented. : 

I agree with Biggs that a great deal more work is needed at the theoretical 
level, but I feel even more strongly that what is needed is a greater concentration 
on experimental studies, with particular reference to parametric investigations. The 
frequently contradictory results obtained by investigators testing both Spence’s 
hypothesis and my own suggest that neither hypothesis 1s universally valid, but 
requires to be closely circumscribed with respect to its validity by detailed specifica- 
tion by such parameters as strength of CS and UCS, CS-UCS interval, proportion of 
reinforced trials and so on. Lynn's study.appears to me to he a logical deduction 
from principles which appear to have at least some experimental support, although 
this support may be rather circumscribed in the manner just outlined, and although 
it may, of course, be possible to account for his results in other terms than those 
premised in his theory. This hardly detracts from the value of his alas 
demonstration. 
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Hunt, J. McV. (1961). Intelligence and Experience. New York: Ronald Press, 
pp. vii4-416, no price given. i 


By P. E. VERNON. 


This is an important book on at least two counts. First, Professor Hunt puts 
‘forward the strongest case yet made for discarding the conception of intelligence as 
inborn potential which is predetermined by the genes and which matures regardless 
of environmental conditions. Secondly, he provides the most complete exposition 
available in English of the whole corpus of Piaget’s theories and experiments on 
mental development, which lead him to conclude that intelligence should be 
regarded as '' central processes comprising strategies for processing information that 
develop in the course of the child’s interactions with his environment.” 


He argues that predeterminism has been implicit in developmental psychology 
generally, for example in the conventional distinction between maturation and 
learning, and in the advice given to parents by niental hygienists not to ‘ push ' their 
children’s intellectual growth. So strong was its influence till recently that the 
common reaction to experimental demonstrations of the modifiability of measured 
intelligence (e.g., those of the Iowa school in the 1930’s) was one of derision and 
emotional controversy. But with the appearance of Hebb’s work on the build-up of 
autonomous central processes and the effects of varied environmental stimulation, 
Piaget’s descriptions of the mechanisms of accommodation and assimilation in the 
progressive developments of schemata, Harlow’s experiments on the acquisition of 
learning sets, and many other lines of evidence, the whole theoretical climate has 
changed. Besides describing this work 1n detail, the book is quite fair in recognizing 
contrary evidence, such as Gesell's classical co-twin control experiments on matura- 
tion, and shows how these can be reconciled with the newer theories. 


Hunt does not deny that genetic factors set a limit to the complexity of what 
can be learned, but his interpretation of the work of modern geneticists leads him to 
deny that we can ever isolate and measure the genetic component either in physical 
or mental development. Attempts to apportion hereditary versus environmental 
variance (e.g., the often quoted 80 per cent.—20 per cent.) are misleading, since at 
best they only reflect the environmental pressures that have operated in the sample 
being studied, and tell us nothing about what would happen under different environ- 

: mental conditions. Many investigations of more extreme environmental deprivations 
indicate far greater effects, and he goes so far as to predict that the application of a 
sound psychology of development could raise the average intelligence by an amount 
equivalent to 30 points of LQ. In the reviewer's opinion, this forecast is un- 
necessarily rash ; we know far more about the conditions that inhibit than those that 
favour the full realization of intellectual potential. And, although Piaget's work has 
thrown a flood of light on cognitive processes, it does little or nothing to improve 
our control of the even more important motivational ones. 


The main key to such improvement, he claims, 1s the matching of environment 
to the current level of the child's schemata ; (he admits that this principle 15 already 
realized by the good school teacher) If the child is set problems which merely 
exercise schemata that are already organized, he gets bored ; while, if they are too 
far ahead of him, he is uninterested or frustrated ; whereas with appropriate choice 
of stimuli, there should be positive pleasure 1n learning just as we find in the baby's 
acquisition of talking and walking. Circumstances which are appropriate at one 
stage may inhibit at another; for example, the highly protective mother may 
provide adequate experience in the sensorimotor stage, yet deprive her child of it in 
the preconceptual (14-4 year) stage. Variation of circumstances is another import- 
ant principle in the development of flexible and transferable skills. 
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These educational implications are among the most original parts of the book. 
For the most part it covers ground which 1s already fairly familiar to educational 
psychologists in Britain (though perhaps less widely known in America). As is only 
to be expected from a polemical treatise, the author's enthusiasm sometimes runs 
away with him. For example, it would have been helpful if'he had given some 
consideration to the strong counter arguments that have been put forward by 
Professor Burt in a series of recent articles in the General and Statistical Journals 
ofthe B.Ps.S. Again, he probably gives too much credence to Piaget and Inhelder's 
claim that each stage develops out of preceding ones (e.g., that realization of 
constancy of quantity always precedes constancy of weight, which always precedes 
constancy of volume); and that a wide range of restructurings tend to occur 
simultaneously (e.g., that the attainment of formal thinking about scientific prob- 
lems is accompanied by the revolutionary attitudes about society characteristic of 
adolescence). The more carefully controlled studies of Peel and Lovell, for example, 
indicate much more variation and inconsistency than Piaget usually admits. 
However, there can be no doubt that this will be in influential source book for many 
years, particularly since, although fairly technical an parts, it is a good deal easier 
to follow than Hebb's or Piaget's own writings. It is strongly recommended to 
advanced students and to lecturers in education and psychology. 


THE APPRECIATION OF BEAUTY 


VALENTINE, C. W. (1962). The Experimental Psychology of Beauty. London: 
Methuen, pp. xiv+438, 45s. net. 


Bv CYRIL BURT 


Professor Valentine's aim has been to present a systematic survey of experi- 
ments on the appreciation of art, music, and poetry, and to determine how far the 
psychological differences between individuals helps to account for the wide differ- 
ences in what is known as ' artistic taste.’ Those who have acted as subjects in 
these various researches have frequently been children of school age. Hence, quite 
apart from the intrinsic interest of the whole problem, the observations recorded and 
the conclusions drawn have a direct bearing on the work of teachers. 

He begins by observing that philosophers, art critics, and aestheticians have 
so far failed to agree on what is meant by the word ‘beauty’, and he himself 
declines to offer any formal definition. Many, indeed, would hold with G. E Moore 
that, strictly speaking, the word ' beauty,’ like the word ‘ good,’ is indefinable : 
the most we can do is to recognize it when we see it, and point to typical instances 
which may serve to indicate what precisely we have in mind. In ordinary life nearly 
all of us distinguish certain conscious experiences as having a common quality which 
we indicate by the term ‘beautiful,’ although in other respects they may vary widely: 
they may be visual, auditory, verbal, emotional, or even purely intellectual. Beauty 
itself seems a special form or aspect of what is technically termed ' value '—a word 
which is interpreted either objectively (as the older writers usually did) by saying 
that it denotes '' something that is worthy to exist," or subjectively, by saying it 1s 


''' what we prefer, like, or admire." But once again, these are merely rough, ostensive 


descriptions. "Value itself is a ' primitive concept,' and as such cannot be defined. 
Yet that need not prevent scientific study. Vague as they are, the descriptions 
given are sufficient to show what sort of experiences we propose to study, what types 
of material we must therefore select, and the kind of questions we need to ask. 

The characterization to which, it would seem, the experiments themselves 
eventually lead (so Professor Valentine tells us in his final chapter) 1s that the 
experience of beauty is distinguished by a ''delighted absorption in the object 
itself" One thing becomes increasingly clear. ‘‘ Art," as Herbert Read has 
emphasized, "is not necessarily beauty: that cannot be said too often or too 
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blatantly.” Nor is there any unique type of feeling that we can label the aesthetic 
"emotion. The experience seems rather to be a perception or perceptual judgment 
of some particular quality, or combination of qualities, which isas a rule accompanied 
by “ a harmonious blend of feelings and a certain attitude of mind.” The unexpected 
complexity of the whole process and the astonishing number of factors that affect it 
are perhaps the most instructive conclusions of the entire series of investigations 
that Professor Valentine describes. 

His book is divided into three parts—the first dealing with visual experiences, 
the second with audrtory, and the third with verbal. In each he begins with the 
simplest types of stimuli, and advances progressively to the more complicated. 
Accordingly, after a brief introduction, his first chapter starts with experiments on 
simple colours, and reports the results of numerous studies carried out by himself and 
others on colour-preferences among infants, school children, and adults from various 
races and countries. So far as comparable data can be obtained, it appears that 
during early childhood vivid colours are everywhere strongly attractive, and that 
red, orange, and yellow are generally preferred to green, blue, or violet. Later, 
green or blue tend to predominate, and white rises high in order of preference. 
Here at the very outset we encounter marked individual differences, and detect the 
operation of a variety of factors, similar to those we meet with in experiments on 
more complex objects. Thus, in one set of investigations it was found that children 
as they grow older become more and more influenced by- personal association, 
particularly in the case of colours that are unsaturated or ‘mixed’ rather than 
saturated and ' pure ' : one boy, for example, says he likes a pale faded yellow-green, 
“ because it makes him think of leaves in Autumn "' ; another dislikes red, '' because 
it reminds him of blood." These illustrate the commonest or ' associative type.’ 
Other children refer to the ' subjective ' feelings the sight of colours arouse: “ this 
yellow dazzles the eye,” “ that bright red makes you feel hot all over,” “ the green 
is so restful and soothing." Somewhat rarer is the so called ' character type’ 
one girl, for instance, speaks of scarlet as '' loud and aggressive " ; another of pink 
as ‘‘ rather demure.” And finally, a few among the oldest adopt an entirely ' objec- 
tive ' attitude, and give reasons relating solely to the colours themselves : they like 
an ultramarine '' because it is such a pure blue," and dislike a certain green '' because 
it has too much yellow in it to bea really good green.” As Professor Valentine points 
out, these four types, with varying frequencies, crop up in every medium and at 
almost every stage. Nevertheless, so he maintains, there is sufficient uniformity to 
suggest, in addition to the effects of experience or temperament, the influence of an 
innate differential response to colours of different hues, changing spontaneously 
as the child matures. From single colours we turn to colours in combination ; 
and at once we find that the mere presence of another hue can greatly modify the 
initial preferences. However, the appreciation of colour-harmony and the distaste 
for colour-clash does not appear until relatively late—six or seven years, according to 
most observers, though with a few individuals it emerges as early as four. 

From colour we proceed to form. There is an unexpected measure of agreement 
over preferences for certain kinds of lines, curves, and linear shapes. Here the 
influence of Einfuhlung or ‘empathy ' is particularly striking: the spectator, so to 
speak, sympathetically puts himself in the place of the lines and shapes; he feels 
himself moving along the graceful curve or supporting the heavy horizontal oblong. 
* Subjective types ' and ' character types’ are thus strongly m evidence. Certain 
zigzags are described as ' lively ' or ' playful,’, others as ' harsh’ or ' cruel.’ Rising 
lines seem ' active’ or ' hopeful’; falling lines ‘weary’ or ‘sad.’ And the experi- 
ments sufficiently prove, if proof were needed, that there is no single line or curve 
which is unanimously preferred. The recurrent search by theorists like Lomazzo, 
Hogarth, and Winckelmann for a so-called 'line of beauty ' ended, we may remember, 
in the serpentine curve of a question-mark. 

From the canons of Polyclitus and Vitruvius onward to the present day, 
numerous writers have maintained that, as in music and poetry, so in art and 
architecture, the secret of beauty lies in the component proportions, and that these. 
proportions could be numerically expressed. The doctrine is most succinctly summed 
up in a dictum of Palladio (Z^ Quattro Libri dell’ Architettura; 1570) : “ the beauty of 
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a structure consists in the exact proportions within the parts themselves and of 
each part within the whole." It is, therefore, not surprising that the earliest experi- 
ments on the appreciation of the beautiful—those carried out nearly a century ago 
by Fechner, “ the father of both psychophysics and experimental aesthetics "— 
were concerned primarly with the problem of the most pleasing proportion. He 
began with the simplest cases—the cross and the rectangle. 

Many teachers will remember puzzling over Definition 3 in Book IV of Euclid’ 8 
Elements: “ A straight line is said to be cut in extreme and mean ratio when the 
whole; ıs to the greater segment as the greater segment is to the less " : i.e., if a 
and b are the two segments, a /b-b/(a--b) ; (the solution is b—(4/ 5--1)a/2; for 
the construction, see Euclid VI, 30 and II, 6). This ratio defines what artists of 
later age called the ' Golden Section.' The proportions of a common filing card, 
3x5-m., are a rough approximation; 5/(3--5) slightly nearer; and 5/(84-5), the 
dimensions of a foolscap sheet of typewriting paper, nearer still This sequence— 
1/2, 2/3, 3/5, 5/8, 8/18, . . . , (based on the so-called ‘ Fibonacci numbers ’) pro- 
gressively approximates the golden section—the proportion adopted again and again, 
sometimes consciously, sometimes unconsciously, by practising artists, architects, 
and sculptors. The rival theory maintained that the ratios for visible beauty should 
be, the same as for audible beauty, namely, those which determined the most 
harmonious intervals in music—1 /2, 2/3, 3/4, 5/6 and 3/5. Both theories held that 
the simpler ratios served for the simpler structures, and that the more complex 
could be appreciated only by the relatively sophisticated spectator or listener. 
Fechner claimed that his experimental results showed plainly that the formula for 
the, golden section furnished by far the most pleasing proportion, and gave ' the 
right combination of diversity in unity." 

Professor Valentine reviews the numerous researches since carried out with 
subjects of different ages to verify Fechner's conclusions At first sight the data 
obtained seem hopelessly at variance. Nevertheless, the main trend is roughly in 
keeping with a generalization suggested by one recent investigator, Miss Donaldson. 
She found that up to the age of about 5, children exhibit no definite preference ; 
then they tend to favour the simplest ratio of all—1 /2—i.e., an oblong which is a 
double square ; later, they prefer a ratio approaching 2/3; later still 3/5; and so 
on, until at about the age of 12 the golden section becomes the most popular (except 
when the oblong suggests some specific object such as a window or door); after 
that, even this rather subtle relation begins to appear flat and insipid, and more 
daring proportions, varying widely with different individuals, come to be preferred, 
with the broader ratios (3/4 or 4/5) predominating. This tallies with the verdict of 
Bacon. “ There is,” he writes, ‘‘ no excellent beauty that hath not some strangeness 
in the proportions : a painter may make a better face than ever was, but he must do 
it by a kind of felicity, and not by rule." And certainly one can hardly suppose that 
any strict mathematical sequence governs the development of the child's preferences 
or that any exact numerical ratio determines our maturer appreciation of beauty. 

R. M. Ogden, it is true, professes to find golden sections, pentagons, ellipses, 
and circles, nearly always underlying the idealized human figure as represented in 
the paintings of Renaissance artists and the statues of ancient Greece. But, as 
Professor Valentine rightly comments, if you draw a sufficient number of geometrical 
Shapes over your picture, some of them are bound to provide an approximate fit. 
.Miss Puffer's experimental results, which he discusses in greater detail, appear to 
come nearer the truth. The proportions involved are not just geometrical ratios in 
space: they express, asit were, a balance of forces. Instead of the simple symmetry 
that appeals to the younger child as he arranges objects on the mantelpiece, the 
experienced eye prefers “ that subtler type of composition, which symbolizes some 
sort of equilibrium ”—particularly, I would suggest, the homoeostatic principle 
so characteristic of the husng form. 

This, I take it, is why the nude has played such a vital part in art and art- 
training. The ‘ Venus coelestis ' of the Greeks (the Cnidian Aphrodite, for example), 
the ' Venus naturalis ’ of the Gothic convention (as exemplified in the ' Eves’ of 
Memlinc or Van Eyck), the sculptured pharaohs of the 18th Dynasty, the David of 

:Michael Angelo, and the beardless and undraped Christ who pronounces sentence in 
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the Last Judgement—all are based on proportional schemes ; and yet their several 
proportions differ as widely from each other as they do from those of-the male or 
female model in the life-class. Hardly any one in my experience, whether educated, 

“illiterate, or avant-garde, ranks even the most artistic. photographs one can find 
either of naked athletes or the winners of the latest beauty contest as more pleasing 
aesthetically than the relatively unrealistic figures of antiquity or the high renaiss- 
ance; and, what is still more significant, even when the inexpert spectator fails to 
distinguish the photograph of a statue or painting from the photograph of a living 
model, the former occasions none of the embarrassment so often caused by the 
latter. Each work of art is, as we say, ‘ idealized '—in the sense that the shapes and 
planes, the anatomy and the general poise, are so simplified as to help the spectator 
to apprehend in a glance something of the tense play of forces that underlies the 
most complex and highly developed forms of life. For the picture or the statue 
itself must have something of the characteristics of a living and organic whole. 

In the whole field of experimental aesthetics by far the most popular subject 
for research has been the appreciation of pictures. One result, on which all investiga- 
tors are agreed, is the way the content of the picture—the story that it tells or the 
subject which it depicts—governs the preferences of younger children and less 
cultured adults For the rest one's first impression is that so many different factors 
influence each person’s artistic judgment, even among the expert and the con- 
noisseur, that any attempt to extract some underlying principle is completely 
swamped: by the overwhelming fact of individual differences. De gustibus non 
disputandum. However, when a systematic statistical analysis is undertaken, the 
proverbial maxim quickly proves to be grossly misleading. If we collect a set of 
(say) fifty pictures rauging from good reproductions of acknowledged masterpieces, 
second and third-rate specimens of every type and variety, down to the crudest 
birthday card to be found in the local paper-shop, and then compare the order of 
preferences obtained from a representative sample of people, including both the 
cultured and the uncultured, the philistine and ' the fellow from the bus,’ as well as 
the professional artist and art critic, we find that almost all the correlations between 
their rankings are positive, and the average coefficient surprisingly high To borrow 
the technical jargon of the mental tester, there is a well marked ' general factor ' 
accounting for about 50 per cent of the ' variance.' That, of course, still leaves 

,ample room for wide individual differences. Subordinate factors indicate the 
presence of supplementary tendencies or types—some .persons, for example, being 
guided mainly by colour, others by form; and to a large extent, their type of pre- 
ference tends to be correlated, not only with their past experience and training, 
but also with their temperamental and emotional outlook. 

Professor Valentine concludes this section with a brief but instructive chapter 
recording experiments carried out by himself and his co-workers on ' reactions to 
modern art.’ Here, the individual differences were even more conspicuous. In 
general, among the ' unsophisticated students,' who formed the bulk of his subjects, 
conservative or conventional judgments were commonest. ' Nevertheless, an un- 
expectedly large number described the pictures as ‘ interesting,’ either (so it would 
seem) because of their novelty or their daring or because of the bewildered curiosity 
they so often excite. Those who like them most are often high-spirited youngsters 
who find the stock types of beauty dull, effeminate, or devoid of interest, and enjoy 
having their tougher emotions stimulated and admire those who have the courage 
to violate the traditional rules ‘‘ At times," as Eric: Newton observes, “it is 
refreshing to smash the china and break bottles, or watch others at the game." 
, Occasionally, one even wonders whether the modernist does not deliberately play 
up to this attitude: indeed, Professor Valentie himself quotes a confession 
attributed to Picasso to the effect that he had, in his later years, intentionally 

: '' fed thése fellows with what they: wanted, and become rich and celebrated through 
amusing myself with these farces.” 

m The appreciation of music 1s dealt with along similar lines, but rather more 
briefly. The experiments begin not with single notes (e.g., those played on instru- 
‘ments of different timbre), but with simple combinations of notes, ize., with a com- 
parison of intervals. Professor Valentine's own experiments on this and kindred 
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problems—first with school children and then with students—were started more than 
fifty years ago. He found much the same types of attitude here as in dealing with 
colours. Moreover, contrary to the assumptions commonly made, the consonant 
intervals were by no means always the most pleasing, nor the dissonant the most 
unpleasant. At the present day, the average order for most persons appears to be— 
major 3rd, minor 3rd, major 6th, minor 6th, octave, perfect 4th, perfect 5th, 
augmented 4th (tritone), major 2nd, minor 7th, major 7th, minor 2nd. This, ıt 
will be seen, corresponds neither with the order suggested by the ratios nor with 
that suggested by the relative absence of beats. The ages at which children can 
distinguish concords and discords, as well as their actual preferences, are greatly 
influenced by early musical training; and during the course of a single set of 
experiments repetition and familiarity will often render a discord as agreeable as 
the concords. 


The long chapter on ‘special factors 1n listening to music ' is one of the most 
interesting in the book. The chief problems discussed—always in the light of 
experimental evidence—are the influence of specific characteristics, like rhythm, 
melody, harmony, and timbre, on the listener's general appreciation, and the effect 
of the general characteristics of the music on his more specific responses—on his 
imagery, for example, and on physical processes such as pulse, respiration, and 
(most obvious of all) definite movement—nodding, beating time, dancing, or 
marching more rapidly or more slowly. The marked effects on speed of movement 
suggests that an instructive research might be carried out on the converse influence— 
that of the natural motor tendencies of a listener, composer, or conductor on the 
style of performance he favours Few conductors, for example, keep strictly to 
Beethoven’s metronome markings; and their differences in regard to tempi are 
amazing. Beecham, at the Festival Hall in 1956, took barely an hour to get 
through The Ninth Symphony ; Hindemith, a couple of years later, took nearly an 
hour-and-a-quarter Now it has been claimed by more than one musical critic that 
the preferred rates of both composers and conductors are noticeably influenced by 
the length of their limbs ; the tempi of Beecham and Kleiber, for instance, are on 
an average much faster than those of Toscanini, Furtwängler, or Klemperer; and 
the numerons allegros of Bach and Berlioz, as contrasted with the ambling andantes 
favoured by Schubert, have been associated with the speed at which, according to 
their biographers, each composer habitually walked. Analogous differences are 
sometimes discernible with tall and short listeners, though I know of no recorded 
correlations. This is merely an incidental example of the numerous out-of-the-way 
factors that keep cropping up as soon as one starts watching for possible influences. 
May we hope that music teachers will not only study the suggestive conclusions that 
Professor Valentine himself has drawn, but will also exploit their exceptional 
opportunities for carrying out similar researches of their own ? 


The concluding chapter in this section is concerned with music as a '' vehicle 
for expressing meaning and emotion '" Many composers, particularly the romantics, 
have openly declared that their aim was to convey emotions, represent scenes, or 
even portray characters (as in Strauss's Ti Eulenspiegel or Elgar's Enigma 
Variations). But the outcome of numerous experimental researches is to demon- 
strate how hopelessly even the most expert composers will often fail. And,' as 
Professor Valentine points out, a number of investigators have accordingly 
concluded that music neither can nor should seek to communicate a visual, verbal, 
or emotional ‘programme’; many musicians would, of course, agree. But the 
criteria adopted by the investigators are, perhaps, too severe. The composer would 
probably not claim that his audience should be able to guess his intention with no 
further clue, but merely that the style of his composition should be consisteni with 
the mood he attempts to convey, which he commonly indicates by a title. Moreover, 
he would usually expect that the competent listener would already be familiar with 
the conventions and devices which he and his predecessors had developed for particu- 
lar purposes. 

Doubtless, there are many things that music cannot effectively communicate. 
But numerous results cited by Professor Valentine show that, m many cases, certain 
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emotions and moods are expressed quite successfully. For example, in one investi- 
gation 93 per cent. of the subjects, given a list of alternative descriptions, agreed 
that the ' Marcia Funebre ’ of the Eroica expressed grief, sadness, or despair (only 
12 per cent. could identify it) ; 63 per cent. gave triumph or determination as the 
mood expressed by the last movement of the C Minor; 72 per cent. agreed that 
Nicolai’s Merry Wives of Windsor expressed amusement or joy, and so on 

When I myself have quoted these or similar examples in lectures or discussions, 
I am nearly always asked ‘Why?’ So may I briefly mention the answer I should 
give. The human voice is our supreme medium for expressing emotion; and in so 
doing it varies recognizably in level and change of pitch, in level and change of 
loudness, in level and change of tempo, and in changes of timbre. Now these are 
all distinctive qualities of music. Where no special emotion is expressed, the melody 
at its simplest tends to follow the stock pattern of speech used in making plain 
affirmations, i.e., its basic ‘ intonation ’ as the phonetician calls it: each ‘ sentence ' 
rises and then falls, and the sentences themselves rise to a climax and then drop to 
the original level; emphasis can be added by special stress. The most familiar 
instances are to be found in well known hymn tunes—e g., Talhs (' The heavenly 
child in stature grows’), Innocents (‘ Gentle Jesus’), or Petra (‘ Rock of Ages’); 
but it is also found, with added emphasis, in popular songs (e.g., ‘ We won't go home 
til morning ') and in classical melodies (e.g., ' Freude, schoner Götterfunken,’ 
in the Choral Symphony). It 1s likewise the scheme of the old psalmodic plainsong 
and of ordinary recitative (musica parlante). If a command, a direct question, or a 
question with an interrogative is to be expressed, the melody tends to follow the 
rise or fall that would be used 1n ordinary conversation. Gaiety and excitement are 
indicated by a raised pitch and a quickened tempo ; rest or sadness by a lowered 
pitch and a slow tempo. Vigour is expressed by wide intervals (e.g , the drops of a 
seventh in the ' Nimrod ' variation) ; weariness or anxiety by the smallest possible 
intervals (e.g, the opening phrases in the overture to Tristan); and note that in 
both cases a suitable difference in timbre is produced by the choice of instruments 
and the unusual harmonization. But I need not multiply the parallels. 

Of course, in highly developed music these echoes of speech-melody get overlaid 
by the distinctive characteristics of the musical tradition to which the composition 
belongs—the devices of key, scale, melody, rhythm, modulation, harmonization, 
and the formal structure, which the composer inherits or chooses to introduce. 
Music as such is an art with principles of its own ; and most musicians, I imagine, 
would hold that, even if an incidental or concomitant expression of definite emotions 
may at times form part of its aim or technique, this is not its primary purpose. Its 
function is to produce aesthetic compositions (in the literal sense of the word 
' composition ’), patterns put together out of the relevant material—in this instance 
sounds with their varying qualities of pitch, loudness, timbre, etc., and their emotional 
assoctations. If, like other works of art, it arouses a specific feeling, the essential 
feeling (as the listener's comments sufficiently prove) is aesthetic feeling. 

Professor Valentine devotes only a few pages to ‘‘ responses to modern music." 
For some unexplained reason, modern music, unlike modern paintmg or modém 
poetry, has failed almost entirely to stir the interest of experimental psychologists. 
‘With one exception, which he mentions in a footnote, he believes his own experi- 
ments are almost the only attempts in this novel field His chief results are, first 
that the judgments passed by the listeners are based most frequently on an ‘ objective 
attitude ' and comparatively seldom on ‘ associations '—quite different from what is 
found in the case of traditional or classical music, and secondly that, in spite of the 
marked differences '' there are far more similarities 1n the responses "' than we might 
have expected. From my own experiments it would seem that young people with 
little musical experience and no musical training rarely appreciate modern music as 
music. Many describe the irregular and the primitive character of the rhythm as 
‘exciting ' or ‘stimulating’, a few feel the strange harmonies or lack of harmony 
to be productive of a heightened tension : one speaks of it as ‘ tormented and rather 
neurotic'; several, particuarly among the younger and more rebellious males, 
find it exhilarating to get away from what they term the ‘sloppy’ or ‘ namby- 
pamby sentimentality ' of the older music—" all pretty-pretty and rather stale,” 


200 Critical Notices 


as one of my listeners observed. Most of those who liked it were already vaguely 
familiar with modernism from signature-tunes and accompaniments in the cinema 
or television programmes. The rest thought it ‘ difficult to follow ’ and ‘ tuneless.’ 
Of those who had enjoyed some musical training, a number described the composi- 
tions as ‘interesting experiments,’ and many.said they would understand and enjoy 
them better after further hearings. The more appreciative fell into two main types : 
the larger group preferred the so-called neo-classicists (as represented, for instance, 
by the later works of Stravinsky); the smaller group preferred the atonalists 
(Schönberg and his various followers), though none used these terms or identified 
the composers. 

However, what the psychologist wants to kmow is this. Not one of the modern- 
ists, I imagine, would accept the common type of criticism—that he is merely 
experimenting, merely trying to be original or to show what ingenuity can accomp- 
lish with a twelve-tone scale, or even just striving to escape from the tedium of the 
old classical rules and the “ linked sweetness long drawn out ” of the later romantics. 
Nearly all would contend that, with their revolutionary principles—rejecting the 
stylized structure and the call for a clear- melodic outline, the traditional formulae for 
resolving discords, and the adherence to definite keys and definite time-signatures— 
they are able to achieve.very much more than any of their hide-bound predecessors. 
But, we must ask, have they succeeded in producing the effects they intended ? Or 
are their intentions comprehensible and perceptible solely to those esoteric followers 
who have expressly studied the new fashions ? On the whole, I am inclined to think 
that the experimental results undoubtedly show that they have, in some measure, 
been genuinely successful. And yet, I would suggest that further investigations of 
the kind Professor Valentine describes are really needed before a psychologist can 
feel fully convinced. One result, I suspect, would quickly emerge: the approach 
adopted by nearly every theorist, from Rameau and Monteverdi to Helmholtz, 
Prout, and Schénberg—all of whom base their doctrines primarily on mathematical 
and physical considerations rather than on psychological—is both mistaken and 
misleading. 

And so we come to the final section of the book, the shortest of all, which deals 
with the appreciation of poetry. Once again I find Professor. Valentine’s own 
experithents by far the most interesting of those that he records. And where he 
disagrees with the conclusions of the literary critics—particularly those of the well- 
known studies of I. A. Richards at Cambridge—Valentine’s views always seem more 
convincing than theirs. In his. work with children he finds (as we might expect) 
that at the younger ages, '' the story is the main thing," and yet (what we might 
not have guessed) many reveal ''a surprising sensitivity to good and bad poetic 
expression ” : ''the sense of beauty develops even before the power of clear under- 
Standing." During earlier years rhythm exercises the most potent influence; the 
appreciation of word-melody and vowel-music comes much later. Visual imagery 
may often enhance enjoyment, though occasionally ıt impairs it. He gives several 
instructive instances to illustrate the child's approach. Wordsworth, for example, 
in'the first edition of his ' Sketches among the Alps ' has theline: '' Under a hoary 
pak's thin canopy ." Later he changed it to '' Beneath an old grey oak as violets 
lie." Young children prefer the former version, '' partly because, as one of them 
puts it, ‘hoary is a nice poetic word’’’; the older usually prefer the second, 
“ because it makes a pretty picture, and sounds best: the ‘hoary’ oak and the 
' canopy ' seem like using grand words for effect." ' 

Professor Valentine adds a further chapter on ‘experiments with modern 
poetry.’ His own appear to be the only investigations on the subject carried out in 
this country, though a few have been attempted in America. Most of his students 
commented on the difficulties presented by such poems, and there seems to be a 
correlation of 0-2 and 0-3 between the ability to comprehend them and the student's 
LQ. However, T. S. Eliot, we may remember, observes that '' the seasoned reader 
does not bother about understanding—at least not at first." Contrary to common 
opinion, the experimental results show platwnly that “ ıt is quite possible for people 
whose favourite poets are Wordsworth, Keats, and Pope to find some modern 
poems also attractive." But the main conclusion that Professor Valentine seeks to 
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emphasize is that here, perhaps more clearly than anywhere else, we find that “ art 
is not necessarily to be identified with beauty." The aim of such poetry is to express, 
not so much a sense of what is ordinarily considered beautiful, but rather an 
“ mtellectual toughness in the midst of passionate feeling " ; and he quotes with 
approval the remarks of Goethe that “ no object need be excluded as unpoetical”’ 
and of Daiches that the merit of the modern style is “ to render, in a way no other 
medium can achieve, a particular apprehension of the excitement and complexity 
of experience.” 

'The book ends with a chapter of conclusions, recapitulating the main results 
and discussing what general inferences are common to all three fields. The first and 
most obvious conclusion, which emerges from almost every experimental study here 
reported, is the wide variety and range of the individual differences. And yet, 
underlying these differences, certain basic principles appear, especially when the 
data are analysed statistically. To begin with, in each of the three fields, there are 
large general factors commonly covering all the persons tested . thus, in spite of their 
superficial peculiarities, each person tends, in a greater or lesser degree, to rank the 
items presented in much the same order of merit. Next, we find another general 
factor, almost as large, covering all the different fields: thus, on the whole and in 
the long run, a person who is a good judge in one field tends to be a good judge in the 
others. This capacity is partly, but by no means entirely, accounted for by the 
general factor of ‘intelligence.’ There seems also to be a general capacity for 
aesthetic appreciation as such. But, in addition to this general capacity, there are 
several specific factors, temperamental (so it would appear) rather than cognitive 
or intellectual—for example, those giving rise to a preference for romantic rather than 
classical works, and again for realistic rather than impressionistic techniques ; 
and each of these seems to influence appreciation 1n all three fields, visual, audible, 
and verbal. Moreover, the rule that “ attention should be at once stimulated and 
facilitated ” is common to all three types of artistic creation. Thus, ın painting, 
in music, and in poetry the principles of composition are very largely the principles 
of attention: there must be sufficient variety, complexity, and novelty to excite 
and retain the observer’s attention ; and at the same time, there must be sufficient 
unity and familiarity to prevent his attention being completely baffled. The span 
of the observer’s attention—a span which, as we know, increases with the increase 
in the child’s mental age—must itself be a factor determining and limiting the 
observer’s appreciation. And, of course, the capacities for discrimination in different 
sensory fields—visual, auditory, and verbal—are also relatively specific, developing 
or maturing at rather different ages. Each of the various capacities and tempera- 
mental characteristics revealed by these experimental results must be to a large 
degree innate. But how much is due to genetic constitution and how much to past 
experience and to cultural environment is a problem on which there have been many 
dogmatic assertions, but, up to the present, little or no scientific investigation. 

It is now just half a century since Professor Valentine published a slim manual 
with the same title as his latest volume Since then, strange to say, no other book has 
been issued dealing with the same general topic. Professor Valentine’s thus remains 
unique, and is certain to take its place at once as a classic. Itis a book which will 
not only have a strong appeal for the interested layman; it is one which every 
psychologist and every teacher should study. The increase 1n its size amply testifies 
to the vast amount of work that has been accomplished during the intervening years. 
Yet there are numerous fields which the experimentalist has scarcely touched—the 
appreciation of prose, of architecture, of industrial and domestic design, of the 
aesthetic aspects of the theatre, the film, and the presentation of television pro- 

es Thus the urgent need is still for further research. This, as Professor 
Valentine rightly insists, is a task in which the artistic expert, critic and creator 
alike, should co-operate with the trained experimentalist far more closely than hither- 
to. And no one, I would add, is so well situated for undertaking and assisting in 
such inquiries as the teacher in the classroom or the lecturer in the training college 
or the schools of art and music. 
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AINSWORTH, Mary D., ef al (1962). Deprivation of Maternal Care. Geneva: 
World Health Organisation, available through H.M.S.O., pp. 165, 12s. 


This book “ attempts a reassessment of the effects of maternal deprivation in 
the light of advances in the last ten years and criticisms of the theory." It consists 
of contributions from the following: D. G. Prugh and R. G. Harlow—Mashed 
Deprivation in Infants and Young Children; R. G. Andry—Paternal and Maternal 
Roles and Delinquency ; M. Mead—A Cultural Anthropologist’s Approach to Maternal 
Deprivation ; B. Wootton—A Social Scwnhisi's Approach to Maternal Deprivation ; 
S. Lebovici—The Concept of Maternal Deprivation—A Review of Research; and 
M. D. Ainsworth—The Effects of Maternal Deprivation—A Review of Findings and 
Controversy in the Context of Research Strategy. 

Inevitably in a book of this nature there is a certain amount of duplication ; 
this is particularly evident in the reviews of research. The contributors have been 
chosen from widely different fields, however, and it is interesting to see differing 
interpretations of the same researches. Some of the contributions seem less significant 
and useful than others. Lady Wooton’s, for example, is, in her own words '' in the 
main an abbreviated report of what I have said already 1n the relevant chapter of 
* Social Science and Social Pathology,’ and adds little. Andry's contribution is 
useful in that it draws attention to the paternal role—as well as the maternal— 
but a purist might quibble about a contribution to ‘ Deprivation of Maternal Care,’ 
which devotes at least half of its length to a study of ‘the pathological role of 
fathers in delinquency ’.” 

As a whole, this book is useful in crystallising and clarifying the many different 
problems generally subsumed under the heading ‘ maternal deprivation.’ A number 
of the contributors rightly insist on the 1mportance of distinguishing the different 
sets of conditions which are commonly and loosely discussed in terms of maternal 
deprivation. Dr. Ainsworth, for example, distinguishes—insufficiency of interaction 
implicit in deprivation, distortion in the character of the interaction without 
respect to its quantity, and the dtscontinusty of relations brought about through. 
separation. 

Dr. Ainsworth’s article is, indeed, probably the most important and useful of 
the contributions; certainly it is the longest, occupying more than a third of the 
length of the book. Its excellence lies first in its detailed discussion of the separate 
questions arising from the concept of maternal deprivation; these include the 
questions of multiple mothers, variability in the degree of damage following depriva- 
tion, specific versus general effects of deprivation, diversity in the nature of the 
adverse effects of deprivation, permanence of the effects of deprivation delinquency 
and maternal versus environmental deprivation ; secondly, in her very full review 
of the various ‘ research strategies’ which have tried to answer these questions. 
This review includes comments on the preceding contributions. 

Her article, as indeed do all the others—includes a very full bibliography. 

In summary, a useful book drawing together, as it does, a mass of research 
evidence, presented and interpreted by authorities with widely differing theoretical 
backgrounds. D. Evans. 


APLEY, J., and MacKzrirg, R. (1962). The Child and his Symptoms—A 
Psychosomatic Approach. Blackwell Scientific Publications, pp. 262, 25s. 


It is not normally the practice of this Journal to review medical publications, 
but a paediatrician colleague suggested that this book would be of considerable 
interest to psychologists who work with children. The book is by two distinguished 
paediatricians and is meant mainly for doctors. It is not a text-book of child 
psychiatry, but rather & comprehensive account of childhood disorders which, it is 
rightly claimed, cannot be understood or treated unless the physical aspects of the 
condition are considered alongside the intellectual, emotional and social. The claim 
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is not, of course, new and all doctors would claim that they do consider their patients 
as whole persons, but they do it by the light of nature and the authors do not think 
this is enough. Rather they point to the need for doctors, particularly those who 
deal with children, to study scientific psychology, for, they point out, '' the science 
of how humans behave is not just an attitude of mind, but an available collection 
of knowledge and techniques." The importance of correct and complete diagnosis is 
stressed throughout, with equal emphasis on the physical and psychological aspects, 
for while ‘‘ the primarily physical disorders of childhood are often transient... for 
psychogenic disorders the prognosis must be more guarded.” 

Many, uf not most, of the conditions dealt with in this book are those which are 
frequently found among children who attend child guidance clinics. They include 
among others, t1cs, eneuresis, recurrent fevers, cyclical vomiting, recurrent headache, 
stomach ache and other pains and the various problems associated with feeding. 
Often they are dealt with jointly by doctor and psychologist, sometimes by one or 
the other, sometimes a psychiatnst or other psycho-therapistisinvolved. Thislittle 
book should do quite a lot to provide a common meeting ground. It will obviously 
help the family doctor to see the need for studying his patients more closely and to 
be on the look out for psychological concomitances and courses. It will help the 
child psychologist to look beyond his discipline and be ready to seek help of a 
paediatrician 

The book is written very simply and clearly, case histories and quotations from 
the literature are sufficient in number and length to illuminate the text without ever 
becoming tedious. The authors are very modest and never dogmatic, they are not 
afraid to admit they do not know. They write, for example, '' Because of our lack 
of experience in treating warts we offer no advice." They are also fully aware of 
what is normal in the physical, social and emotional development of children so that 
they are well able to offer caution about those conditions which children may or may 
not ' grow out of.’ 

There is no doubt that many readers of this Journal will find this a very valuable 
book both for general reading and for reference to special syndromes specific to 
children under their care. L. B. BR. 


GINSBERG, M. (1962). On the Diversity of Morals. London: Heineman, 
pp. xiv+329, 12s. 6d. 

This book is a collection of papers originally published between 1932 and 1955. 
They form what is primarily a study in the philosophy of moral discourse and of 
sociology ; and they cluster round certain themes. Ginsberg is concerned to 
examine what he calls “ the possibility of a rational ethic." He regards the distinc- 
tion between questions of fact and of value as valid and important, and he rejects a 
naturalist interpretation of ethics. Naturally, therefore, one of the chief problems 
that faces him as a sociologist is how to account for the diversity of morals mani- 
fested by different communities, present and past. This diversity has served in an 
important way to support the doctrine of ethical relativity (namely, ‘‘ that moral 
value is relative to the subject asserting it or to the type of society in which it is 
commonly asserted,” with the implication “ that there 1s no rational way of deciding 
between differing moral judgments and, indeed, that the distinction between true 
and false does not properly apply to them.") Ginsberg is much concerned to show 
that the facts of moral variation do not consititute a good, let alone a conclusive, 
argument for ethical relativity. He examines the sorts of moral variations that do 
occur—for example, variations arising from differences of opinion and knowledge 
regarding the non-moral qualities of acts or their consequences; and variations 
arising from the possibility of alternative ways of satisfying primary needs. He also 
examines the nature of moral ideals and their relation to our basic needs. He appears 
to argue that we can proceed rationally to decide between differing moral judgments 
very much further than is commonly supposed—by submitting them to rational 
tests such as coherence and comprehensiveness. He says that “it may quite 
well be that advance in ethical thought has been hindered by the comparative failure 
of the social sciences to throw light on the relevant facts." But he concludes on & 
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cautious note. ‘‘ Whether any fundamental divergences in moral outlook will 
remain when the facts have been clarified and the ideals elucidated remains to be 
seen." However, one of the types of variation in morals is that “ due to differences 
of moral insight and general level of development, moral and intellectual." Ginsberg 
attaches great importance to differences in the level of moral development, and 
this fits in with his argument about the nature of sociology. He maintains that the 
comparative method in sociology is a valid one, and that its further and extended 
use is of great importance. In using this method, sociology will have to face up to its 
immediate tasks of investigating the nature and criteria of development, and of 
arriving at an adequate social morphology. Some of the essays contain a number of 
suggestions about the analysis of the key concepts of social evolution and develop- 
ment. 

I am inclined to think that Ginsberg is quite right 1n maintaining that we can 
get much nearer rational agreement over moral matters than we normally suppose. 
He does not deal, however, with the query : how is he rationally to persuade people 
to adopt an ethic that satisfies his tests of rationality ? If we are interested in achiev- 
ing rational agreement over morals and in controlling the sources of disagreement, 
we will regard lack of knowledge about fact as an important and deplorable state of 
affairs. Unfortunately, we shall not be able to deal with this limitation to our 
capacities until much more social resources have been devoted to the investigation 
of fact than seems to be forthcoming at present and in the foreseeable future. 
Ginsberg's view of sociology is an unfashionable one in this period of specialisms and 
ultra-cautious empiricism But he is almost certainly right to insist that, if sociology 
15 to be able to generalize at all, it will have to make use of the comparative method. 
He 1s also quite right to emphasise the developmental features of societies. For these 
are features that are just apt to be ignored by the current practitioners in the field, 
in spite of the large role they play in current popular and political judgments, and 
in spite of their intrinsic importance. What 1s especially memorable about these 
essays 18 what they reveal of Ginsberg's own moral ideals. They reveal, in particular, 
his belief 1n the value of sober and objective assessment of issues, and yet also his 
deep and passionate humanity, and his faith in the power of human reason to 
contribute to our social development. B. A. FARRELL. 


KASTELL, JEAN (1962). Casework in Child Care. London: Routledge and 
Kegan Paul, Ltd., pp. xi+306, 35s. 


In England and Wales there are more than sixty-thousand children in the care 
of local authorities. This book describes the statutory provision for their welfare, 
and the work of children’s departments on their behalf. It contains little that is 
new, but it makes an excellent appraisal of the problems and methods of social 
casework and gives a clear picture of its difficulties and limitations. It carefully 
describes the way a child-care officer deals with such problems as taking a child into 
care, placing him with foster-parents and arranging for his adoption, and illustrates 
these procedures by detailed case studies. 

The book is well written, in clear, unemotional language, and shows a deep 
concern for the welfare of deprived children. It shows a sympathetic understanding 
of their needs and those of the adults who work with them. 

It is the best book on the topic known to the present reviewer. M. J. TYERMAN. 


Mays, J. B. (1962). Education and the Urban Child. Liverpool: Liverpool 

University Press, pp. ix-++208, 25s. A 

The appearance of books and articles on the theme of the ‘blackboard jungle,’ 
has focussed attention on the problems of the school ın the city slum. Dr. Mays and 
his colleagues in Liverpool have studied the question without sensationalism, have 
discussed a number of relevant specific issues and have related their observations to 
the wider theme of “ the relationship between education and social change in the 
central area of a modern city " (page 1). In the concluding chapter, the aim is 
summarized as '' to describe the work of a group of inner city schools in relation to 
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the general social background and to assess their achievement in promoting the social 
objectives explicit and implicit in the 1944 Education Act " (page 178). 

'The factual material of the report provides useful information in a neglected 
area of educational research. It shows, for example, that the schools in this area are 
not a ' blackboard jungle,' but a reasonably organised educational provision with a 
fairly stable teaching force. As the evidence accumulates, we see more clearly what 
is involved in the gigantic task of establishing equal opportunity in areas of social 
disadvantage. 

Specific issues are discussed from time to time: parent-teacher relationships, 
the gulf between the school curriculum and city life, the teacher as a welfare worker, 
the visiting of homes, the growth of the county college. Use of the index will locate a 
number of brief practical essays on important topics set among the factual material. 

The general sociological reflections, consideration of the obstacles to equality 
of opportunity and the nature of the sub-culture of the area, are dealt with mainly 
in the first chapter and the last (which, perhaps, should be read first), but also 1n 
other chapters. 

These three levels of writing on the subject are mixed throughout the text. 
According to their interests, readers will find some parts tedious, but they will also 
find pages to reflect on. Social factors in education have not even yet received the 
attention they deserve. Books on general sociological themes, or on anthropology, 
or social psychology, or sociometry, often lack a clear educational relevance. The 
kind of survey which Dr. Mays has carried out in Liverpool and the kind of report 
he has written make a useful contribution to the literature which teachers and those 
who train teachers should be familiar with. J. D. NISBET. 


Munenv, L. B., ef al. (1962). The Widening World of Childhood. Basic Books, 
Inc., pp. xi4-399, $10.00. 


This is one of a number of publications arising from longitudinal studies of 
normal children. The book deals specifically with the presentation and initial 
analysis of straightforward observations on thirty-two pre-school children placed in 
a number of new situations over a period of time. Such situations included both 
structured and semi-spontaneous types. The authors have sought to discover 
something of the ways in which children of this age cope with strange situations 
until either they attain some degree of mastery or they relinquish both interest and 
attention. Thus, the work is concerned with behaviour analysis, but there is 
incidental reference to the ends attained by individual children. 

Working from the observational material, the authors demonstrate the complex 
interplay of many factors determining the child's reactions to his environment, 
amongst which home influence, past experience, present perceptual and affective 
organisation and temperamental qualities are important. Newer from the point of 
view of psychological analysis are the ensuing attempts to show sequences of 
behaviour accompanying individual progress towards mastery. Parts III and IV 
of the book outline a comprehensive framework of such sequences, showing both 
variety and similarity in children of widely differing talents and backgrounds. 
Inone sense, this framework represents a more detailed analysis of Piaget's concepts 
of assimilation and accommodation. 

Studies of observational data fulfil an important research role, both in placing 
a perspective on information known to workers through more subjective and less 
organised means, and in supplying hypotheses for further study in a manner which is 
not possible in a more rigidly defined experimental situation. Though such studies 
are both time-consuming and difficult to carry out, each adds considerably to the 
body of knowledge about children, quickly repaying the effort involved. The present 
study offers a further store of material prolific in its interpretations for those con- 
cerned with children at any stage. Yet two advances have still to be made. In this 
investigation, it is pleasing to see the observation of ‘normal’ children—for so 
often it is their ‘abnormal’ counterparts. But the situations in which the children 
were observed are still very far from ‘ normal’ in any sense. More particularly, all 
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analysis in the present work, as in all others known to the writer, is qualitative. 
Methods of modern statistics have not yet been adapted to furnish tools for an 
adequate appraisal and enrichment of such work. D. M. Luz. 


Pear, T. H. (1962). The Moulding of Modern Man. George Allen and Unwin, 
pp. 220, 21s. 


If Professor Pear had not existed he must surely bave been invented | Someone 
of wide sympathies and great enthusiasms had to take care that all the subjects 
which intelligent laymen have been concerned about over the past half century 
should not be ignored just because intelligent psychologists have to be so nervously 
cautious in view of the equivocal scientific status of their discipline. But Professor 
Pear’s interests, sensitivities and activities are so multifarious that it would have been 
a project for group research to assemble them in some sort of unity—and one can 
guess what would have happened if a committee had been responsible for the 
enterprise. 

There is no question but that man is moulded by society and moulded by 
conditioning processes both in his formative childhood and in more active maturity, 
processes of which he is only in our century becoming scientifically aware. But 
early philosophers, sociologists and literary critics said as much long ago and 
supported the thesis by cogent argument and evidence. What makes us today so 
sensitive to Vance Packard and Madison Avenue ? Why do we fear the emergence of 
new magicians in the persons of brain-washers, sub-perception operators and sleep- 
indoctrinators, happiness and truth-drug pedlars ? 

Might the psychologists have some responsibility ? They have certainly been 
insufficiently insistent upon one brute empirical fact, unexplained throughout the 
story of human enquiry and probably not susceptible to the process of explication, 
namely that human beings are in some crucial sense active and creative centres of 
that great energy of evolutionary process which ensures that ‘novelty always arrives 
to enrich.’ It is an emphasis on what we can do to control the forces that do so much 
to us and our children that makes Professor Pear's book so precious. But is dis- 
cursive style and organisation and its offering of not very full or very well supported 
analyses of problems like indoctrination and opinion-making are sometimes impatient 
making. These problems are inexplicably bound up with value-judgments and the 
dramatic but very special modern cases in ‘ Grand Guignol ' dress have not changed 
the problems in principle. It was Freeman, in his Social Psychology, who exhibited 
as an example of the unconcsious ‘moulding of man’ Thorndike’s Arithmetic of 
1917. In the second book of this work in two hundred pages there are 643 problems 
which accept as a natural basis the profit motive and the structured private capital- 
ism. Freeman suggested problems of the type of “ If in a Southern cotton mill, 
one out of every hundred has pellagra, how many new cases can be expected if we 
increase the number employed by one thousand ? ” But arithmetic texts containing 
such problems would be ‘ propagandist ' and not ' educational.’ 

The problems which Professor Pear reflects upon are so many and vary so 
much in size and disciplincary complexity that it is perhaps appropriate that the 
style is that of the essayist, humane and erudite but hurrying. We are hurried 
through the intellectual mazes of the Nature of the Self, the Relation of the Individ- 
ual to Society, the Concepts of Loyalty, Treachery and the Definitions of Culture 
and the Reality of Progress, with occasional descents into factual reportage of the 
modern kaleidoscope of curious happenings and sensational writings about which it 
is so interesting and so easy for us all to read. Professor Pear makes it even easier, 
but one is troubled as to whether itis always quite a good thing. Is it not increasing 
the whole unhealthy bother about the ‘ hidden persuaders ' so that we forget our 
obligation to adopt values which require us to do some quite conscious thinking, 
some deliberate and avowed persuading and some strenuous and ingenious activity 
in terms of these values? This energetic response from Professor Pear should, of 
course, stimulate something more active than a Grand Guignol ‘ frisson °’ in his 
readers. GEORGE WESTBY. 
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SCHONELL, F. J., McLeon, J., CocuranE, R. G., (eds.) (1962). The Slow Learner : 
Segregation or Integration. St. Lucia, Queensland, Australia: University 
of Queensland Press, pp. 92, 15s. 6d. 

In this symposium—first published in 1960-61, in the University of Queensland’s 
journal The Slow Learner—sixteen contributors from ten countries discuss four 
aspects of the education of subnormal children: the aims—social, scholastic and 
vocational; the relative efficiency, in achieving these aims, of various forms of 
special educational provision ; the identification of those who would best profit by 
special education ; the types of provision in different countries. 

Within their 2,500 words, some chapters give a resumé of research ; others are 
descriptive, statistical and historical, showing interesting discrepancies between 
policy in special education and general educational philosophy, for example, in 
England, U.S.A. and the U.S.S.R. The Russians’ insistence on comprehensive 
diagnosis might well have come from a British textbook, but their reservation of 
' remedial schools ' for the organically subnormal raises many problems. Vernon, in 
a further contribution to his discussion of the I.Q., as a measure of potential, shows 
the difficulty of inference from psychometric data; in criticism of segregation he 
quotes both the evidence that some special school pupils excel their secondary 
modern counterparts, and the suggestion that segregation may increase retardation 
by lowering aspiration. 

Several contributors claim that separate provision improves social integration 
by giving opportunity for guidance in social skills, and by eliminating '' possibly the 
cruellest form of segregation—social rejection within the ordinary class." There is 
some supporting evidence that the subnormal child is happiest either in a special 
class or in a small normal class. A great limitation in discussion of this problem is 
the dependence even of experienced judgment on the quality of the provision with 
which the writer is familiar ; well-controlled research is needed, on the lines of that 
carried out on gifted children. In the absence of such objective evidence, some of 
the descriptive contributions are naively insistent and repetitive, but Segal’s five 
criteria for successful teaching in the ordinary school are soundly-based and well- 
supported. 

The discussion is introduced and summed up by the deputy director of the 
remedial education centre at the Universy of Queensland The problem faced is 
not solved—nor is it claimed to be—but the reader will gain (as from any useful 
study in comparative education) protection from the risks of taking the familiar 
solution for granted, and of wasteful trial and error along lines already thoroughly 
explored elsewhere. WILLIAM CURR. 


ScHONELL, F. J., Roz, E., and MEDDLETON, I. G. (1963). Promise and Perform- 
ance. University of London Press and University of Queensland Press, 
pp. xxv+405, 34s. 


Francis Bacon wrote, '' Some books are to be tasted, others to be swallowed, and 
some few to be chewed and digested." This book belongs without question to the 
last category, but after the mental digestion, it should be placed on one’s reference 
shelf as a mine of valuable information. Though the book is concerned only with the 
University of Queensland and some of its findings have restricted applicability, there 
is much that has relevance to university problems in this country. 

The enquiry studies the progress of 400 of the students over six years, examining 
their abilities, attainments and attitudes in relation both to their promise on entry 
and their subsequent performance. There are also sections on the functions of the 
staff, including lecturing and tutorials, with a brief but important comment on 
e; ; 
Mention is made here of a few points that are of particular interest. Even in this 
Australian university, with its relatively large intake, the working classes are 
severely under-represented, even when ability and-attainment were held constant. 
A bi-serial correlation between matriculation score on the one hand and normal 
progress or failure to make normal progress, gave rp —0-48. A valuable histogram 
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(p. 44) shows this relationship in detail. The impossibility of drawing any precise 
line which would permit almost all successful students to enter and at the same time 
exclude all potential failures is illustrated by the fact that in Matriculation Group 
IV one in every three students had made normal progress, while the proportion was 
one in four for Group V. In individual subjects a significant relationship between 
Matriculation results and first year university results was found in objective subjects 
such as Chemistry, Physics and Mathematics, but not in English, French, History 
or Latin. Only one-third of students who matriculated at the second attempt made 
normal progress later. (This and related findings reminds one that there will 
inevitably be a strong tendency for university markers to grade their students as 
good average, poor and failures in accordance with the average standard of the student 
population 1n that course, a standard which may change appreciably during a few 
years without altering the proportion of failures). 

The relationship between 1.Q. score and university performance was only 0-39. 
When I.Q. was added to matriculation score there was no improvement in the 
predictive power of the matriculation score,though students with both low matricula- 
tion and low I.Q. appeared to be a poor risk. Only 11 per cent. of this group gradua- 
ted in minimum time, compared with 50 per cent. for other students. Whereas at 
Liverpool there was 71 per cent. normal progress, at Queensland it was 49 per cent. 
As one expects, there were wide variations in the failure rates of different Faculties, 
only 35.7 per cent of students making normal progress in Engineering. In all, 
only 22 per cent. of all students who failed in First Year (1950-52 entry), and 13:2 
per cent. for the 1955 entry had no further failure. 

One intriguing finding which does not surprise the reviewer but has many 
implications, is that although there was no significant difference between college and 
non-college students in their courses of study, the concensus of the evidence gives 
the non-college students the better performance— but readers need to make a 
detailed appraisal of the figures. 

This is an important book which advances our knowledge. That it has not 
‘solved ' the problem of the selection of university students 1s not surprising, in 
view of the complexity of the problem. Indeed, as the authors realise, it can never 
be completely solved. The reviewer forecast some years ago that the ' break 
through ’ will comein the field of temperamental charactistics and motivation, and 
the findings of the present book have strengthened this belief. R. R. DALE. 


SHIELDS, J. (1962). Monozygotic Twins Brought Up Apart and Brought Up 
Together. Oxford University Press, pp. x-+264, 50s. 


This study follows the lines of the now classical inquiry by Newman, Freeman 
and Holzinger. But, instead of the nineteen pairs of twins in that inquiry, Dr. 
Shields has at his disposal forty-four pairs of monozygotic twins reared apart, a 
control group of forty-four pairs of monozygotic twins reared together, and a further 
group of thirty-two pairs of dizygotic twins, of whom eleven pairs were brought up 
apart. These twins were selected from volunteers after a B.B.C. appeal for twins 
in 1953. Though some bias in selection is recorded, for example, an excess of women 
and the middle aged, this does not invalidate the inquiry. 

The investigation is reported carefully and objectively, and the conduct of it 
inspires confidence in the author’s scientific integrity. There is no sound of axes 
being ground. It is all the more unfortunate that some aspects of the assessment are 
rather flimsy. The author is rather naive about intellectual assessment. '' It might, 
therefore, be expected that a non-verbal intelligence test might be a good measure of 
innate capacity in twins, while any intellectual differences due to cultural or educa- 
tional environment might show up more readily on the verbal intelligence test.” 
(p. 58). It is a pity that more extensively used tests of intelligence were not used. 
The same point applies to the assessment of personality. Here Eysenck’s Self-Rating 
Questionnaire for Neuroticism and Introversion-Extraversion has been used. 
While one of the most adequately standardised of these instruments, it also has 
a rather specific area of assessment. It is possible that the writer has recognised 
these difficulties, as he also develops a scheme of degrees of differences of personality 
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based on personal interview. Perhaps, also, he overestimates the subjective element 
in such assessment. To this is added physical measurements and fully recorded case 
histories. These case histories occupy almost half the book, and make rich and 
interesting reading. 

And the conclusions? The findings do not differ significantly from those of 
earlier studies in the same field. In the writer's own words, '' 1—-Family environ- 
ments can vary quite a lot without obscuring basic similarity in a pair of genetically 
identical twins. 2.—Even monozygotic brought up together can differ quite 
widely (p. 156). The interesting part is the way that there occur exceptions to the 
general trends, exceptions which have no apparent explanation. JamMES MAXWELL. 


SPEARRITT, D. (1962). Listening Comprehenston—A Factorial Anaylsts. 
A.C.E.R., pp. x4-148, 42s. 


'The author of this book points out that as modern societies grow in size and 
complexity, the need for effective communication becomes all the more necessary, 
and although reading and writing are the more recently developed of the com- 
munication skills they seem to have been more readily accepted as subjects of school 
instruction than have the older skills of listening and speaking. Yet, we have hither- 
to had no factor analytic studies that have included listening comprehension tests. 
This book, which is (in effect) an extremely systematic and thorough Ph.D. thesis, 
not only makes an original contribution to our knowledge, about listening compre- 
hension, bnt also serves as a model to research students, particularly in the factorial 
field, preparing to write an account of their own investigation. The techniques 
employed are in line with much recent American factor analytic practice, i.e., the 
use of normalised T scores, product moment correlations, principal components, 
oblique analytic rotation (Biquartimin) towards simple structure, and the extraction 
of factors with eigenvalues greater than unity, followed by second order factorisa- 
tion. Itisinteresting to find that the human rotations give a poorer simple structure 
than the Biquartimin rotation, yielding hyperplanes of something like 10 per cent. 
less density, with very little compensating advantage in the number of relatively 
high loadings. The results, based on thirty-four tests, are most informative, as few 
of the hypotheses put forward by the author were fully verified. For example, the 
suggestion that there is no factor of listening comprehension separate from reasoning, 
verbal comprehension, attention and memory was rejected. Nor could the hypo- 
thesis be sustained that listening comprehension is related to meaningful memory 
but not to rote memory and span memory. Of course, the results do not imply that 
listening comprehension should be compulsorily taught simply because it has been 
Shown to be an independent factor. The case for formal teaching of listening com- 
prehension must be decided on its own merits. 

There is little to fault in this book except perhaps the statistical appendix, which 
will surely be too elementary for the cognoscent: and too difficult for everyone else. 

In summary, this book 1s recommended, not only for readers interested in 
listening comprehension, but also for those concerned with the design of theses on 
experimental topics, particularly in the area of factor analysis. F. W. WARBURTON. 
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CORRECTION 
to article by Crookes and Green in the February issue. 


Following line 7 on page 39 it should read : “‘...dominance. The difficulty 
in reading and writing would follow as a result of the...” 
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: TEACHERS' PERSONALITY DESCRIPTION OF 
SIX-YEAR-OLDS: A CHECK ON STRUCTURE 


By RAYMOND B. CATTELL 
(University of Illinois) 


SuMMARY. Since factor analytic evidence on personality structure in young 
children is still uncommon and incomplete, an independent, checking analysis 
is presented on the excellent data gathered by Digman, on 102 seven and eight 
year-olds, by use of behavior ratings on a standard set of thirty-nine bipolar 
' marker ' traits. 

The general agreement with Digman's findings is good, seven of his eleven 
factors closely matching seven of the twelve here, in ‘ blind’ rotations. How- 
ever, where there are differences, the technical points by which differences in 
factor analyses may be evaluated are illustrated. In particular, the differences of 
interpretation are evaluated, for the present writer's factors A, cyclothymia ; 
C, ego strength ; D, excitability ; E, dominance; and so on to O, guilt prone- 
ness, are more exactly confirmed here. This better confirmation coincides with 
(a) a higher hyperplane count (mean 70 per cent.), and (b) a higher statistical 
significance of simple structure by the Bargmann test. However, the differences 
of the independent analyses are small compared with the agreements. 

The study has relevance also to the cross-cultural constancy of personality 
factors, in that the subjects were roughly half Japanese, half Caucasian, in 
Hawaii. 


I.—THE PRESENT STATUS OF RATING FACTOR RESEARCH, 


SiNcE Burt's (1940) pioneer factor analyses of ratings of personality traits in 
children, the main concepts have been considerably consolidated. Studies 
agree on some twelve to eighteen source traits (Cattell, 1957) which have been 
indexed as A through O, and include such dimensions as cyclothymia-vs- 
schizothymia (A), ego strength-vs-general emotionality (C), dominance-vs- 
submissiveness (E), surgency-vs-desurgency (F), and super ego strength-vs- 
super ego weakness (G). Although the popularly expressed notion that person- 
ality will be ‘ simpler ' in children may be in some sense true, the inference that 
it implies markedly fewer factors has not proved true. For, as the collations of 
results over age by Cattell (1957) have shown, the main factors show continuity 
from pre-school child to adult, the former being classifiable on such dimensions 
as dominance-submissiveness, cyclothymia, surgency, etc., as definitely as 
adults. 

On the other hand, agreement among the three media of observation— 
L-data (life record, rating £5 situ), Q-data (questionnaire) and T-data (objective, 
situational or laboratory test)—has not been so clear. The present writer's 
conclusion (1961) is that the same number and nature of factors appear in 
L-data as Q-data, but that correlations between results from the two sources 
of scores are much reduced by intrusion of ' instrument factors ' and ' refraction 
factors,’ especially in the rating field, e.g., between parents’ and teachers’ 
ratings. However, most will agree with such writers as Digman (1963), Pascal 
and Jenkins (1960) and Sells (1961) that clarification of the rating factor situa- 
tion is important, because these personality factors are the real, criterion 
behavior, from which Q and T- data, as merely test data, must draw their 
validity. 
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Unfortunately, there are still some disagreements on the nature of person- 
ality factors in rated everyday life behavior which spring from the failure of 
many studies to reach what have long been recognized as minimal satisfactory 
research conditions. As discussed elsewhere (Cattell, 1957) these include : 
(a) use of more than one rater; (b) a position of the rater to permit observation 
of substantial varieties of behavior; (c) clearly, behaviorally-defined traits ; 
(d) carrying over of a sufficiency of factor marker traits from study to study ; 
and (e) thorough factor analytic technique with respect to testing for number of 
factors and rotating to an unquestionably unique simple structure resolution. 

Since the last survey made by the present writer (1957) only three of the 
accessible studies came close to meeting these requirements, namely, those by 
Tupes and Cristal (1958), Norman (1959) and Digman (1963). The former 
suffer from under-extraction, but they nicely confirm the factors concluded from 
earlier work, as far as they go. The systematic studies by Digman are of 
particular interest, and will be made the basis of analysis here, because (a) the 
data are very soundly gathered ; (6) they contribute to an age level—the 6 to 
8 year old level—on which few clear findings exist, except those of Koch (1942), 
Peterson and Cattell (1959), and Cattell and Coan (1957), and which, because of 
the formative importance of these early school years, are of especial importance 
to teachers; and (c) they are gathered in a culture—Hawaii, with about half 
Japanese families—sufficiently different from the results previously collated by 
Cattell (1957), to test the hypotheses hitherto checked only by Q-date (Cattell, 
Pichot and Rennes (1961), Gorsuch (1963), Tsujioka (in press) that the person- 
ality factors A through O are so basic in personality structure that they 
essentialy transcend cultural and racial differences. 


II.—METHODOLOGICAL STANDARDS OF A RE-ANALYSIS. 


The fact that Digman’s research, in spite of its excellent foundations with 
respect to requirements (a) through (d) above, has only somewhat more than 
half its factors agreeing with those of the collated results from other sources, 
justifies a special demonstration of the importance of steps in (2). This article 
proposes an independent analysis of Digman's raw data ; a sharp focussing of 
the technical points at issue in the reaching of consistent findings in this field, 
and a crystallizing of the conceptions which can be used and tested in person- 
ality research with young children. 

Digman began by using the comprehensive list of behavior variables 
distilled from many previous researches (Cattell, 1957) and which has been used 
by Coan and others as a marker framework. These thirty-nine bipolar defined 
variables were rated by teachers on 102 seven and eight year old boys and girls, 
by procedures which may be examined in Digman’s account (1963). He used a 
principal axes solution, with estimated communalities, and experimented with 
an orthogonal varimax solution which, as Cattell and Dickman (1962) have tried 
to show, is incompatible with a truly maximal simple structure solution. The 
solution we present here differs from Digman's in (a) concluding that there are 
slightly more factors, and (b) rotating to a simple structure which maxtmizes the 
number of zeros, regardless of whether the final solution presents uncorrelated 
or correlated factors. As a general principle, the present writer maintains that 
orthogonality occurs only as a rare and accidental feature when simple structure 
is single-mindedly pursued. Digman later moved to a final oblique resolution. 

As tests of (a) above— completeness of extraction—we applied to the 
factorization of Digman's correlation matrix three tests, namely, those of 
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Tucker (1938), Sokal (1959) and the Kaiser-Pawlik method.* These converged 
on twelve factors. (See Diagram 1.) 


DIAGRAM 1 


EVIDENCE ON COMPLETENESS OF FACTOR EXTRACTION * TUCKER'S METHOD. 





Mur, 2 c 4d © 6 7 8 9 O ARAA 
Factocs Extracted 


Communalities estimated by Burt's method (1940) were then inserted and, 
through five iterated factor analyses, were found to converge to fixed values for 
these twelve factors. The resulting unrotated matrix is shown in Table 1. 


s The search for simple structure was pursued in three steps of successive 
refinement: (1) by the Oblimax analytical computer program (Pinzka and 
Saunders, 1954) ; (2) by the correction of certain systematic faults in analytical 
programs through a tidying by the Maxplane program (Cattell and Muerle, 1960); 
(3) by a final hand polishing, maximizing the precision of the already clear 
hyperplanes by the Rotoplot (oscilloscope plot) program. Although the last is 
by hand and eye, it is kept, in the true scientific sense, ' blind,' since variables 
are unnumbered. The investigator thus cannot let his prejudices guide him, and 
the end result is a truly independent piece of evidence on personality factor 
structure. 


* 'Tucker's method has hitherto been most used by the present writer, but it disregards 
the size of sample, and Sokal has shown (1959) that his method gives better results on 
empirical data for which the number of factors is known. Pawlik’s modification of Kaiser’s 
method, worked out in this laboratory, is to insert ones for communalities, to plot sizes of 
successive latent roots, and to stop (usually im the neighborhood of a latent root of unity) 
when a point of inflexion occurs as the descending curve becomes a straight line. It is the 
best method we have yet fo und, in agreeing with previous known answers. 
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TABLE 1 


PRINCIPAL AXES FACTORS. 
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TABLE 2 


TRANSFORMATION MATRIX. 
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It is also methodologically important to point out that: (a) a simple structure 
resolution draws its objectivity not from machine handling, but from the fact of 
a unique maximum having been attained in the hyperplane count. In the present 
case a very thorough search through fourteen overall rotations after the oblimax 
and maxplane machine programs was carried out with results shown in Table 2. 
It will be seen that a mean value of 70 per cent. in the hyperplane was reached, 
with no single factor below 59 per cent. ; (b) that being objective and maximal 
is not enough. It must also be significant. The significance of the simple 
structure was tested here by Bargmann’s test (1954). This test, as argued 
elsewhere (Cattell and Pawlik, in press), needs improvement since it makes 
assumptions which in some situations result in under-estimation of significance. 
By this criterion one factor, No. 3, is non-significant and two, Nos. 5 and 8, are 
only at the 10 per cent. level, but the rest are mainly at <-01 or «-001. The 
rotation by Digman, which we are compelled to consider a less acceptable 
alternative interpretation of the factors to that given here, has, by the same 
criterion, four non-significant (His Nos. 1, 2, 3 and 4) two at -25 (His Nos. 7 and 
8) and the remaining five at the same level as our acceptable nine. 
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TABLE 3A 
PRIMARY Factor PATTERN. 
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TABLE 3B 


List oF VARIABLES IN TABLE 3A 


Non-aggressive, kind, considerate versus aggressive, tends towards fighting, 
bullying, teasing, cruelty. 
Unpopular, generally disliked by other children versus popular, generally liked 
by other children. 
Poor general health, prone to absence by reason of illness, or physical complaints 
versus of gen y good health. 
Quiet versus noisy, distracting in class 
Free of jealousy, feels accepted versus prone to jealousy. 
Learns slowly versus learns fast. 
Conscientious, trustworthy versus untrustworthy, dishonest. 
Self-sufficient, independent versus dependent on teacher. 
Submissive, follows lead of other children versus self-assertive, tends to dominate 
other children. 
Prefers not to be noticed versus demanding of teacher’s attention. T 
Well poised, tough, sticks up for own rights when threatened versus easily upset, 
overwhelmed by teasing of other children, yields easily to persuasion 
Lacking in self-confidence, easily discouraged or defeated versus confident 
haps over-confident) of own ability and ideas. 
Placid, free from distress versus fearful, worrying, anxious. 
Calm, relaxed versus over-active, excitable, perhaps irritable. 
Cheerful versus depressed. 
Changeable in interests, attitudes, opinions versus stable in interests, attitudes, 
opinions. 
Inattentive, absent-minded, prone to day dreaming, shows poor concentration 
versus very attentive to class proceedings. 
Lethargic, apathetic, easily tired or fatigued versus alert, wide-awake, energetic. 
Responsible versus irresponsible, frivolous. 
uitting, fickle versus persevering, determined. 
cially awkward and clumsy versus polished in manner 
Practical-minded versus imaginative. 
Self-abasive, deferent, minimizes own importance versus self-centred, conceited, 
boastful, ‘ show-off’ 
Lacks nervous habits versus prone to nervous habits (e.g , thumb-sucking, nail- 
biting, pulung and twisting hair, grimacing). 
Neat, tidy, orderly versus untidy, careless with respect to appearance, of self 
belongings. 
Adaptable, flexible versus rigid, has difficulty adjusting to changes or new 
situations. 
Careful with property of others versus careless, destructive of property of others. 
Secretive, reserved versus expressive, frank. 
Lacking ın artistic feeling versus aesthetically sensitive, aesthetically fastidious. 


‘ .Has difficulty following instructions versus follows instructions easily and 


accurately. 

"Parents appear generally accepting toward child versus parents appear generally 
rejecting toward child. 

Parents appear relatively permissive, allow child to make independent decisions 

versus parents appear dominating, over-protective toward child. 

Shy, bashful, seclusive, aloof, remains fairly isolated from other children versus 
outgoing, mixes freely with other children. 

Associates mostly with children of opposite sex versus associates mostly with 
children of own sex. 

Prefers solitary pursuits versus gregarious, prefers games involving many 
children. 


Negativistic, stubborn, disobedient, argumentative versus co-operative, 
compliant, obedient 

Trustful of others, readily accepts solicitude of others as sincere versus sus- 
picious of others, ungrateful, rejects affection or solicitude 

Retiring, cautious versus adventurous, bold, willing to take the chance of 
possible rejection or injury. 

Female versus male. 
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Possibly the hyperplane on our unsatisfactory factor No. 3 will be improved 
by some investigator trying further shifts, but perhaps also it is the nature of 
Ego Strength (see Table 6) that it affects too much of the personality to leave 
any marked hyperplane. At any rate its match to the pattern obtained with 
the maximum ue structure position in other studies is good. The accepted 
unique resolution for all factors is shown in Table 3a, but, for convenience, the 
labels for the variables (positive pole to the right) in it are listed in Table 3s. 
It should be noted that Table 3A is not left in the reference vector system (in 
which the 4-:10 hyperplane count was taken) but in the ¿rue factor system, 
and the intercorrelations given below are similarly those of the primary factors, 
and are greater than for the reference vectors. 


IIL.—INTERPRETATION OF FACTORS. 


It is our purpose now to compare the meanings of the factors here obtained 
with those of previous studies. For convenience their order has been made the 
same in Table 3 as that of the letter index system used in dozens of other 
published studies in this area, and the axes have been reflected to put the 
positive signs in the familiar direction. Only the top six to eight variables are 
listed, corresponding to those clearly outside the hyperplane, and for the reader's 
convenience, the verbal labels have been adjusted to signs, ie., by reading 
straight down the right-hand side one will get the labelled behaviors as they go 
together. 


TABLE 4 


FACTOR 2—SIZOTHYMIA versus AFFECTOTHYMIA, A, 


Trait Description 





Negativistic, stubborn, disobedient, argumentative versus 

co operativa; compliant, obedient. 

Suspicious of others, ungrateful, rejects affection or solicitude 
versus trustful of others, readily accepts solicitude of others 
as sincere 

Agressive, tends toward fighting, bullying, teasing, cruelty 


versus non-aggressive, kind, considerate. 

Untrustworthy, dishonest versus conscientious, trustworthy. 

Self-assertive, tends to dominate other children versus 
submissive, follows lead of other children 

Rigid, has difficulty adjusting to changes or new situations 
versus adaptable, flexible. 

Prone to jealousy versus free of jealousy, feels accepted. 

Self-centred, conceited, boastful, ‘show-off’ versus self- 
abasive, deferent, minimizes own importance. 
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The markers for cyclothymia-vs.-schizothymia, namely co-operative, 
trustful and adaptable, occur here and nowhere else. This, together with its 
large size (À tends to rank as the first factor in size, as its original symbolization 
indicates) defines it. The only problem here is whether the Protension factor L, 
is mixed with this (L is otherwise missing) and might have separated if an extra 
dimension had been extracted. The s index of similarity puts the match of this 
with A in the Coan (1957) study beyond P <-Olvalue. We believe sizothymia- 
vs-affectothymia would, in the future, be a better label for this factor.* 


TABLE 5 


Factor 2—INTELLIGENCE, B. 


Trait Description. 





Learns slowly versus learns fast 

Has difficulty following instructions versus follows instructions 
easily and accurately. 

Self-abasive, deferent, minimizes own importance versus 
self-centred, conceited, boastful, ‘ show-off.’ 

Inattentive, absent-minded, prone to day dreaming, shows 
poor concentration versus very attentive to class proceed- 
in 


gs. 
Lethargic, apathetic, easily tired or fatigued versus alert, 
wide-awake, energetic. 
Dependent on teacher versus self-sufficient, independen. 
Free of jealousy, feels accepted vreus prone to jealosy 
Lacking in self-confidence, easily discouraged or defeated 
versus confident. . x 





This is clearly Spearman's ' g,' picked up in ratings, as it often is, with the 
associated traits we are accustomed to find with it in the classroom situation, 
e.g., lack of self-confidence and inattentiveness at the low intelligence pole. 
The symbol B has been used for ' g ' in the total personality expression, reserving 
the latter for the strictly cognitive measurement. 


* Because of its excellent confirmation of the Bleuler and Kretschmer schizothymia- 
cyclothymia dimension, Factor A has been so named smce its discovery ın 1945 (Cattell). 
Unfortunately, teachers and parents have associated the term schizothymia more with the 
schizophrenic abnormality which can attack schizothymes than with the normal and 
efiective temperament. After fifteen years of battling with thus misapprehension, we are 
suggesting that the A factor axis be relabled as stzothymia-vs-affectotothymra (or cyclothymia). 
This stresses the emotional detachment or flatness of A(—) (sizo from assidere, as in the root 
for painter's size, to make colors ' lie flat’) At the opposite pole, since in the normal range 
the affective aspect of this temperament is more prominent than the cycloid mood swings, 
it would improve the A + reference to call it affecto-thymia (agreeing, in abormal pol ections, 
with the currently favored designation ‘ affective’ rather than ‘cyclic’ for the affective 
psychoses). The clearer sound distinction of sizothymic and affectothymic would also be a 
contribution to oral discussion, 
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TABLE 6 


FACTOR 3—GENERAL EMOTIONALITY versus Eco STRENGTH, C. 


Trait Description. 


Quitting, fickle versus persevering, determined. 

Associates mostly with children of opposite sex versus associ- 
ates mostly with children of own sex. 

Demanding of teacher's attention versus prefers not to be 
noticed. 

Changeable 1n interests, attitudes, opinions versus stable 1n 
interests, attitudes, opinions. 

Overactive, excitable, perhaps irritable versus calm, relaxed. 

Noisy, distracting in efins versus quiet. 

Self-centred, conceited, boastful, ‘ show-off’ versus self- 
abasive, deferent, minimizes own importance. 

Parents appear relatively permissive, allow child to make 
independent decisions versus parents appear dominating, 
over-protective toward child. 

Negavisitic, stubborn, disobedient, argumentative versus 
co-operative, compliant, obedient. 





One encounters here the usual difficulty in separating C, ego strength, from 
G, super ego strength (since both typically share perseverance), on the one hand, 
and 0 (~) factor on the other (since C (-) and 0 typically share ‘ worrying, 
anxious ’). Persevering, calm, stable and co-operative (lowest) are taken as the 
best definition of C, and no other pattern here has these, though Factor 4, D, 
below has some claim, and the closeness to 0 (-) Factor 12, is mitigated only by 
the latter’s emphases on specifically fearful reactions, and its absence of general 
emotionality-vs.-calmness. If the reader sets this beside the C pattern as given 
in the Child Personality Questionnaire (Porter and Cattell, 1960, p. 26) however, 
he will immediately see the closeness of match. 


TABLE 7 


Factor Á—DEPRIVATION versus EXCITEMENT, D. 


Parents appear relatively permissive, allow child to make 
independent decisions versus parents appear dominating, 
over protective toward child. 

Parents appear generally accepting toward child versus 
parents appear generally rejecting toward child. 

Lacks nervous habits versus prone to nervous habits (e.g., 
thumb-sucking, nail-biting, pulling and twisting hair, 


grimacing) , E 
Calm, relaxed versus overactive, excitable, perbaps irritable. 
Adventurous, bold, willing to take the chance of possible 

rejection or injury versus retiring, cautious 

Lethargic, pathetic, easily tired or fatigued versus alert, 

vy wide-awake, energetic. 

E Lacking in self-confidence, easily discouraged or defeated 
versus confident (perhaps over-confident) of own ability 
and ideas. 

Placid, free from distress versus fearful, worrying, anxious. 
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Normally the central feature of D is excitability, with assertive attention- 
etting. The mixture of excitable with energetic, confident, yet nervous and 
earful comes nearest here to the usual D. Also the tendency to pervade the 

whole personality (only 62 per cent. in hyperplane) more than other factors 
(except F and G) agrees. The new variables here—dominating and rejecting 
parent--raises crucial issues for further research on its nature. 


TABLE 8 


Factor 5—SuBMISSIVENESS versus DOMINANCE, E. 


Trait Description. 





Female versus male. 

Non-aggressive, kind, considerate versus aggressive, tends 
toward fighting, bullying, teasing, cruelty. 

Calm, relaxed versus overactive, excitable, perhaps irritable. 

Inattentive, absent-minded, prone to day dreaming, shows 

concentration versus very attentive to class proceed- 

ings. 

Quiet versus noisy, distracting in class. 

Dependent on teacher versus self-sufficient, independent. 

Practical minded versus imaginative. 

Polished in manner versus socially awkward, and clumsy. 








The only factor with which dominance is occasionally confused are D, 
Excitability, above, and I (-), Harria (Table 12) below. Both E and I (-) are 
substantially sex (male) correlated in every group yet studied. The association 
of aggressiveness, activity, self sufficiency and ‘ socially clumsy and awkward ’ 
(‘ unconventional’ in adults) suggest this is dominance (while ‘esthetic’ 
marks I as different). 


TABLE 9 


Factor 6—DESURGENCY versus SURGENCY, F. 


Trait Description. 


Unpopular, generally disliked by other children versws 
popular, generally hked by other children. 

Depressed versus cheerful. 

Associates mostly with children of opposite sex versus associ- 


ates mostly with children of own sex. 

Lethargic, apathetic, easily tired or fatigued versus alert, 
wide-awake, energetic. 

Prone to jealousy versus free of jealousy, feels ted. 

Easily upset, overwhelmed by teasing of other children, yields 
easily to persuasion versus well poised, tough, sticks up for 
own rights when threatened. 
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The only problem usually met in recognizing the F pattern is its separation 
from H, since both involve active social behavior. But there is no difficulty 
here since the markers cheerful and energetic so clearly predominate. The s 
ch of this pattern with Coans' study E is at P «-02 and with Coan’s H at 
P<-25, 


TABLE 10 


Factor 7—SuPEREGO STRENGTH, G. 


Trait Description. 





Untidy, careless with respect to ap nce, of self and 
belongings versus neat, tidy, orderly. 

Irresponsible, frivolous versus responsible. 

ess, destructive of property of others versus careful 

with property of others. 

Quitting, fickle versus persevering, determined. 

Changeable in interests, attitudes, opinions versus stable in 
interests, attitudes, opin;ons. 

Socially awkward, and clumsy versus polished in manner. 

Inattentive, absent-minded, prone to day dreaming, shows 
poor concentration versts very attentive to class proceed- 


ings. 
Untrustworthy, dishonest versus conscientious, trustworthy. 





This has all the marks of the super ego pattern, G, and needs no discussion. 


TABLE 11 


Factor 8—THRECTIA versus PARMIA, H. 


Trait Description. 


Prefers solitary pursuits versus gregarious, prefers games 
involving many children. - 

Retiring, cautious versus adventurous, bold, willing to take 
the chance of possible rejection or injury. 

Secretive, reserved versus expressive, " 

Shy, bashful, seclusive, aloof, remains fairly isolated from 
other children versus mixes outgoing, freely with other 
children. 

Associates mostly with children of own sex versus associates 
mostly with children of opposite sex. 

Neat, tidy, orderly versus untidy, careless with respect to 
a ce of self and belongings: 

Le ic, apathetic, easily tired or fatigued versus alert, 
wide-awake, energetic. 

Socially awkward, and clumsy vrrsus polished in manner. 





This has all the typical marks of H and its characteristic large variance 
contribution (including even the “ tendency to mix more freely with the opposite 
sex,” usually clearer at later ages, in this factor). The s index with the average 
pattern in Cattell’s earlier collation for the pattern is P<-05 and with Coan’s 
P<-05. 
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TABLE 12 
Factor 9—HARRIA versus PREMSIA, I. 


Trait Description 





Male versus female 

Lacking in artistic feeling versus aesthetically sensitive, 
aesthetically fastidious. 

Prefers not to be noticed versus demanding of teacher's 
attention. 

Practical minded versus imaginative. 

Submussive, follows lead of other children versus self- 
assertive, tends to dominate other children. 

Secretive, reserved versus expressive, frank. 

Well poised, tough, sticks, up for own rights when threaten- 
ed versus easily upset, overwhelmed teasing of other 
children, yields easily to persuasion. 

Shy, bashful, seclusive, aloof, remains fairly isolated from 
other children versus outgoing, mixes freely with other 
children. 

Free of jealousy, feels accepted versus prone to jealousy. 





Again, the match to the previous concepts and empirical patterns for the 
factor is excellent. Out of feminine, esthetically sensitive, imaginative, demand- 
ing, easily upset, dependent and jealous only dependent is here missing. This is 
the typical environmentally produced pattern called premsia (as an acronym 
for protected emotional sensitivity). Mascalinity-femininity has a disportionate 
loading because of its perfect reliability | 


TABLE 13 


Factor 10—CoasTHENIA, J. 


Trait Description 


Poor general health, prone to absence by reason of illness, or 
physical complaint versus of generally good health 

Self-assertive, tends to dominate other children versus 
submussive follows lead of other children. 

Popular, generally liked by other children versus unpopular, 
generally disliked by other children. 

Inattentive, absent-minded, prone today dreaming, shows 
poor concentration versws very attentive to class proceed- 
ings. 

Follows instructions easily and accurately versus has 
difficulty following instructions. 

Trustful of others, readily accepts sohcitude of others as 
sincere versus suspicious of others, ungrateful, rejects 
affection or solicitude. 

Gregarious, prefers games involving many children versus 
prefers solitary pursuits. 

Associates mostly with children of opposite sex versus 
associates mostly with children of own sex 
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This is a pattern of such low loadings that it is almost certain to have its 
‘salients’ obscured by erratics. It has been tentatively identified as J, 
Coasthenia (see Porter and Cattell, 1960, p. 33) which is marked by individual- 
ism, unpopularity, and lack of zest ; but the match is not good and a second, 
but still poor possibility is L, Protension. It must await more intensive represen- 
tation of variables crucial to its clarification. 


. 


TABLE 14 


FACTOR 11—PRAXERNIA versus AUTIA, M. 


Trait Description 





Lacking in artistic feeling versus aesthetically sensitive, 
aesthetically fastidious. 

Practical-minded versus imaginative 

Gregarious, ers games involving many children versus 
refers soli pursuits. 
generally good health versus poor general health, prone to 
absence by reason of illness or physical complaints. 

Lacks nervous habits versus prone to nervous habits (e.g., 
thumb-sucking, nail-biting, pulling and twisting hair, 


grimacing). 

Learns slowly versus learns fast. : 

Neat, tidy, orderly, versus untidy, careleas with respect to 
appearance of self and belongings. 

Associates mostly with children of own sex versus associates 
mostly with children of opposite sex. 





‘ Imaginative ' and ‘ esthetically fastidious ' typically load both factor I, 
Premsia, and factor M, Autia. That this factor is M is indicated by I being 
already present (Table 12 above), and by the fact that this lacks the traits of 
over-protectiveness characteristic of premsia. At the same time it has the 
essential meaning of Autia, as ' autistically wrapt up in one's own ideas, and 
lacking in practicality ' as shown by ' prefers solitary pursuits,' ' imaginative ' 
(this can be all things to all raters, but in this context, it evidently has the mean- 
ing of a vivid internal life), and ' careless of appearance.’ 

While this pattern has the main markers for the O factor of guilt proneness— 
notably ‘ worrying,’ ' easily discouraged,’ ‘ easily upset,’ and ' dependent ’—it 
also has some at lower loadings which normally go with protension (L factor or 
* paranoid tendency ’), such as ' prone to jealousy,’ ' physical complaints ' and 
' easily upset.' There are two possibilities : (a) that one more factor should have 
been extracted, so that posstbly this factor would have split into a larger 
O and a smaller L ; (b) that in younger children a common pattern exists out 
of which guilt proneness persists later, as a direct development, and the paranoid 
' protension ' develops as a special reaction formation modification. This is an 
interesting theme which could be handled by a concentration of factor analytic 
research in this area. 
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TABLE 15 


Factor 12 —GuiLT DEPRESSION OR SUPEREGO PRONENESS, O. 


Trait Description 








Placid, free from distress versus fearful, worrying, anxious. 

Adaptable, flexible versus rigid, has difficulty adjusting to 
changes or new situations. 

Confident (perhaps over-confident) of own ability and ideas 
versus lacking in self confidence, easily- discouraged or 
defeated. 

Of generally good health versus poor general health, prone to 


absence by reason of illness, or physical complaints. J 

Prefers not to be noticed versus demanding of teacher’s 
attention. 

Well posed, tough, sticks up for own rights when threatened 
versus easily upset, overwhelmed by teasing of other 
children, wields easily to persuasion. 

Self-sufficient, independent versus dependent on teacher. 

Free of jealousy, feels accepted versus prone to jealousy 

Lacks nervous habits, versus prone to nervous bits | 
thumb-sucking, nail-biting, pulling and twisting 
grimacing) 





IV.—SUMMARY AND CONCLUSION. 


A factor analysis of checked personality rating data, on a standard and 
comprehensive set of variables, making systematic comparisons possible with 
researches on other samples and ages, has been described. The study is presented 
partly to show how more intensive technical attention to checks on (a) the 
number of factors to extract, and (b) the significance of the achieved simple 
structure, can produce closer agreement of factor analytic studies than has 
often prevailed, and can contribute basically to our concepts of child personality. 


For the present research reaches a higher and virtually unimprovable 
hyperplane count compared with the alternative published analysis of the same 
data. Except for discrepancies in the two points named, the alternative study 
stands at a high technical level among experiments in this field and ends in 
factor patterns seven out of ten of which match seven present here. However, 
the present study gives very close matches indeed, in every factor (except 
possibly for J), to the standard list, indexed A through O, representing ite 
collation of previous researches (Cattell, 1957). These twelve factors fit 
those found with children, by Coan (1957), Peterson and Cattell (1959) jo 
(1957), Koch (1942) and others, but also have clear perce wi twelve ai 
the-sixteen most fully described adult factors (Cattell, 1946; Degan, 1952; 
Huffman, 1957; Lorr, 1953; Norman, 1959; Thurstone, 1934; Types, 1958 ; 
and Wittenborn, 1951). 

Unfortunately, the use of rating factors in experimental studies to evaluate 
maturationally and environmentally determined changes in individual children 
suffers from severe practical difficulties, due to the incomparability of different 
raters and observation situations. It is, therefore, particularly encouraging to 
find here that the behavior rating factors agree in number and meaning with 
those in a practical measuring instrument—the Child Personality Questionnare 
(Porter, 1960)—derived from similar factoring of questionnaire descriptions of 
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behavior. Each of these factors is also found, though with two additions 
appearing with increasing age, in the High School Personality Questionnaire 
(Cattell and Beloff, 1960). However, as the present writer has pointed out, with 
more experimental and conceptual background, elsewhere (1961), this.does not 
necessarily imply that the correlation should be unity between corresponding 
questionnaire and rating factors, because of the intrusion of instrument factors 
into the variance of both rating and questionnaire variables. 


In view of the discrepancies which exist between the Digman study and the 
present one, in the meaning of a few smaller factors, it is most desirable that 
further independent studies with adequate rating conditions, om the same 
variables, should be carried out at this 6-8 year old level. Only then will it be 
timely to debate and experiment upon the hypotheses recently put forward 
(Cattell, 1957 ; Digman, 1963) for these behavior patterns. 


However, in thus critically focussing on lesser discrepancies, one must not 
overlook the main conclusion which is one of agreement and, indeed, in an 
ent across the appreciable cultural gap from the present writer's Caucas- 

ian subjects in the Mid-West and Digman's more than 50 per cent. racially 
Mongolian subjects in Hawaii. The recent cross cultural comparisons of 
independent simple structure factor analyses of the Sixteen Personality Factor 
Questionnaire (Cattell and Stice, 1962) in France (Cattell, Pichot and Rennes, 
1961), Italy, Japan ànd the U.S.A. show clearly that such basic source traits as 
cyclothymia-sizothymia, dominance, surgency, ego strength, etc., described in 
ratings above, persist in essential form in all cultures investigated. It is part 
of the special value of the Digman data that it contributes towards checking, 
in the rating field, this theory that the A through O factors are common to 
humanity. 


V.—REFERENCES. 


BARGMANN, R. (1954). Signifikanz untersuckungen der Einfach en Struktur. Mitt, fur 
Mai Stat, Sonderdruck, M Physica-Verlag. 

Bort, C. L. (1940), Factors of the Mind. London: University of London Press 

CaTTRELL, R. B. (1945). The description of personality Amer. J. Psychol , 58, 69-90. 

CATTELL, R. B. (1946). The Description and Measurement of Personality, New York: 
Yonkers, World Book. 

CATTELL, R. B (1957). Personality and Motivation Structure and Measurement. New 
York: Harcourt, Brace and World 

CaTTELL, R. B (1961). Theory of situational, instrument second order and refraction 
factors in personality structure. Psychol. Bull., 58, 160-174. 

CaTTELL, R. B. and BELorr, H. (1960). The High School Personality Queshonawe 
National Found. for Educ. Res. London: Wimpole Street. 

CaTTELL, R. B., and Coan, R A (1957), Chıld personality structure as revealed in 
teachers’ ratings i; Clin Psychol , 13, 315-327, 

CATTELL, R B., and Dickman, K. (1962). A dynamic model of physical influences 
demonstrating the necessity of oblique simple structure Psychol Bull., 59, 389-400 

CATTELL, R. B., and MUERLE, J. L. (1960). The ' maxplane ' pro for factor rotation 
to oblique simple structure. Educ. Psuchol Measmt., 20, 569-590. 

CATTELL, R. B., Picuot, P., and RzwwEs, P. (1961) Constance interculturelle des 
facteurs de personnalité mesures par le test 16 P.F. II.—Comparaison franco- 
américaine. Revue de Psychologie Appliquée, 11, 165-196 

CATTELL, R. B., and PAWLIK, K. (1863 . A new test of significance of simple structure. 
(Laboratory Rept, Univ. of Illinois, Lab. Person. Assess.) 

CATTELL, R. B., and Sticg, G. F. (1962) The Sixteen Personality Factor Questionnaire. 
N.F.E.R., London. 

Decan, J. W. (1952). Dimensions of Functional Psychosts. Richmond, Va. : Byrd Press. 

Diemwan, J. (1968). The principal dimensions of child personality as inferred from 
teachers’ judgments. Child development 34, 43-60. 


R. B.,CATTELL 235 


Gozsuca, R. L. (1963). A culturally sampled check on the second order structure among 
questionnaire personality primaries. (In Press). 

Hurrxax,P. W. (1957). See Cattell, R. B., ref. 1957, above 

Kocs, H. L (1942) A factor analysis of some measures of the behavior of pre-school 
children... J. Gen. Psychol , 27, 257-287. 

Lorr; M., and RUBENSTEIN, E., and Jenkins, R. L. (1953). A factor analysis of n- 
d ratings of out-patents in psychotherapy. J. of Abnorm. Soc. Psycgol., 48, 511- 
514. 

NorMAN, WARREN T. (1959). A check on behavior rating factors in adults. J. Abnorm. 
and Soc Psychol. In Press. 

PASCAL,G. R., and JrFNKINs, W. O. (1955). A study of early environment and its 
relation to adult behavior. J. of Abnorm. Soc. Psychol., 43, 279-284. 

PETERSON, D. R., and CaTTELL, R. B. (1959). Personality factors ın nursery school 
children as derived from teachers’ ratings. J. Consult. Psychol., 23, 562. 

PiNzEA, C., and SauwDzns, P. R. (1954). Analytic rotation to simple structure : II.— 
Extension to oblique solution. (RB-54-31) Educational Testing Service Bulletin 
(Multilithed) 

PORTER, R., and CATTELL, R. B. (1960). The Child Personality Questionnaire. 
Champaign, IH.: Institute of Person and Abil. Testing, 1602, Coronado Drive. 

Sers, S B (1961) (Edit). The dimensions of stimulus situations which account for 
behavior variance. Symposium No. 1, Psychology Department, Texas Christian 
University, Texas. 

SoxaL, R. R. (1959). A comparison of five tests for completeness of factor extraction 
For Kansas Acad. Sci., 62, 141-1952 

THURSTONE, L. L. (1934) The vectors of the mind. Psychol. Rev., 41,1-35. 

Tsvjyioxa, B. Simuarities and differences ın personality structure in Japan and the 
U.S.A., as shown by factoring the 16 P.F. (In Press) 

Tucker, L., in Thurstone, L. L (1938). Primary Mental Abihhes. Chicago: Univ. of 
Chicago Press 

Turzs, ERNEST, C., and CRISTAL, RAYMOND, C. (1958) Stability of nality-trait 
rating Lope obtained under diverse conditions. U.S.A.F., W.A.C:D. Tech. Note, 
No. 58-61. 

WITTENBORN, J. R (1951). np patterns in a group of mental hospital patients. 

J. of Consult. Psychol., 15, 290-302. 


(Manuscript received 1st January, 1963) 


AN EXPERIMENT ON THE EFFECTS OF MENTAL SET 
IN THE BROOK REACTION TEST 


Bv D. H. HARGREAVES, A. W. HEIM ann K. P. WATTS. 
(The Psychological Laboratory, Cambridge) 


Summary. In the Brook Reaction test an attempt is made to assess the sub- 
ject's interests. Words potentially denoting one of six selected interests 
were, 1n this experiment, presented consecutively to an experimental group, 
and in randomised order to a control group. Some degree of mental set was 
induced in the experimental subjects, reaching significance level in the case of 
one interest only. 


I—INTRODUCTION. 


IN a preliminary note on the Brook Reaction test, Brook and Heim (1960) 
confirmed the hypothesis that people's individual interests may be revealed by 
their responses to an ambiguous word association test. They used 106 orally 
presented stimulus-words each of which had two or more meanings and at least 
one of these meanings related to one of several specific interests. The subjects 
were thirty Cambridge university students; the non-test criterion consisted of 
interview ratings; and significant (tau) correlations between the two criteria 
were obtained for seven of nine interests. 

In the original experiment, the stimulus-words were presented in random 
order, words potential denoting similar interests being generally separated 
from one another. Half of the subjects took the test words in the order 1-106 
(order A) and the other half took the test words in the reverse order (B). No 
significant differences were found as a result of these two orders. 

It was thought interesting, however, to see whether a mental set could be 
deliberately induced in the subject, by presenting several similar interest words 
consecutively ; to ascertain how many successive words of similar interest- 
category would be necessary to effect such a set ; and to determine the extent 
of individual variation with regard to the set. We were interested too in possible 
interest-differences between arts men and scientists. With these aims in view, 
the present experiment was designed. 


IL—EXPERIMENTAL DESIGN. 


The subjects consisted of forty Cambridge university undergraduates. 
Twenty were arts men, reading English, law, modern languages, theology or 
history, and twenty were scientists, reading natural or mechanical sciences or 
mathematics. These forty were divided into two groups, as follows: 


TABLE I 
20 Control 20 Experimental 


10 ke 10 Bre 10 Arts 10 Science 
| 


5A sB sa sb da 5B da sh 


Thus, both the Control and the Experimental groups contained ten arts 
men and ten scientists, half of whom took the words in one order and the 
other half, in the reverse order. All the subjects were male and aged 18-23 
years. 
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They were told that the experiment was on reaction times and were asked 
to give the first word or phrase that entered their mind immediately after hearing 
each test-word. A stop-watch was in evidence. As in the previous experiment, the 
ambiguity of the test-words was not recognised by the majority of the subjects. 

Since the original experiment the number of interest-categories had been 
increased from nine to eighteen. For the Experimental group, six of these were 
chosen: entertainment, food and drink, intellectual, practical, religion and sport. 
The stimulus-words were then grouped, classifying any that could be so grouped 
under one of these six heads. This left a total of thirteen ‘ neutral’ words 
that is words connoting for instance, aesthetic or political or business interests— 
categories which differed from the six interests chosen for the ‘mental set’ 
investigation. Six (or seven) of these neutral words occurred at the beginning, 
and seven (or six) occurred at the end of the test, according to whether order A or 
order B was used. For the Experimental A subjects, the order was as follows : 

neutral words (e.g., ' labour ’) 

entertainment words (e.g., ' cast ’) 

a linking word (e.g. ‘ character ’) 

intellectual words (e.g., ' books ') 

a linking word (e.g., ' style ’) 

sport words (e.g., ' match ’) 
a linking word (e.g., ' pole ’) 
practical words (e. E. ort lover ’) 

a linking word (e (eg. * flour ' (or ‘ flower ’) 

food and drin. words ( (e.g., ‘ bacon ") 

a linking word (e.g., ' service ’) 

religious words leg., ‘ minister ’) 

neutral words (e.g., ‘ sale’ (or ‘ sail’). 

The B order was the exact reverse. It may be seen that the linking words have 
possible associations with the preceding and succeeding ' set ' groups of words. 
For example, the stimulus-word ‘service’ occurs between the food and drink 
and the religion groups. The response ' restaurant’ would be taken as a food 
and drink association whilst the response ‘ church ' would be taken as a religious 
association. Similarly, with the linking word ‘character’: the response 
‘play’ would be classed as entertainment whereas the response ‘mind’ is 
classed as intellectual. 

After having taken the test, each subject completed a seven-point rating 
scale on himself with regard to the relevant interests. In all but five cases a 
second interests-rating was obtained on each subject, by a person who knew him 
well. Good agreement was found between the pairs of ratings, a difference of 
more than one point being extremely rare. 


ITI.—REsvtts. 
A—The grouping together of stimulus-words with similar interests. 
TABLE 2 


Frequency 
Control | Exptal. Control | Exptal. 


Interest Category 


0- 
4. 
0- 

‘0 
0 
2. 
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Table 2 shows the results of our attempt to induce a mental set, by arrang- 
ing the stimulus-words in six concentrated groups. The second and third columns 
show the frequency of interest responses, given by the Control and the Experi- 
mental subjects, respectively. It may be seen that in every case, the frequency 
of the Experimental subjects exceeds that of the Control subjects. This differ- 
ence always is in the expected direction, but in only one case, that of food and 
drink, is the difference statistically significant. This ‘set’ group included such 
words as ‘roll’ (‘role’), ‘date,’ ‘ mint,’ and ‘ palate’ (‘palette’). The values of 
x? are given in column 4 and those of p, in column 5. 

The pair of columns on the extreme right show the mean rating on each 
of the ‘set’ interest categories for, respectively, the Control and the Experi- 
mental subjects. It is thought worthwhile to give this since it was just possible 
that the Control/Experimental differences in columns 2 and 3 were due to 
genuine differences in the interests of the two groups of subjects. The ratings 
make it clear, however, that the finding is, in fact, a result of the induced ' set,' 
the biggest rating difference being in favour of the Control group (entertainment) 
as opposed to the Experimental group, and two of the other interest-categories 
(intellectual and food) yielding identical means.. None of the differences between 
mean ratings approach significance level. 

A contingency table for all the 'set' groups of stimulus-words taken 
together (twelve degrees of freedom) yielded a x? of 20-504 ; p=0-058. This 
borderline significance level of confidence for the Experimental testing as a 
whole, suggests that there is an overall effect of ' set,' although the effect was 
insignificant for five of the six categories considered separately. 


B—The linking words. 

It was hypothesised that if the A versus B presentation of the stimulus- 
words had any effect, this would appear in the responses to the linking-words : 
that the linking word—and perhaps one or two of the stimulus-words on either 
side of it (which were also carefully chosen)—might show the effect of the ‘ set’ 
group that was. taken immediately before. In seven cases out of ten this 
tendency was found and in one case only was the opposite tendency clearly 
shown. 

An example may help to clarify this. The linking word ‘ flour ’ (or ‘ flower ’) 
occurs between the practical and food and drink ‘set’ groups. ‘Flour’ is 
flanked by ' pick-up ' and ‘ joint,’ these two words being, respectively, the last 
of the practical stimulus-words and the first of the food and drink stimulus words 
in order A. Subjects taking the test in this order produced from these three 
stimulus words a total of seven practical responses and two food and drink 
responses. Those taking the test in order B produced only three practical 
responses but eighteen food and drink responses. 


C—Arts/Sctence differences. 

The differences between the arts men and the scientists on (i) the test 
and (ii).the ratings, were inspected. For this purpose, both the Control and 
the Experimental subjects were included, since the groups were equal in 
number and contained equal numbers of arts men and scientists. The four 
interests which yielded the greatest differences were practical, scientific, clothing 
and people. The scientists displayed greater interest in the former two, and the 
arts men in the latter two. In all four cases, the test scores and the ratings 
agreed with one another—very strikingly in the case of the practical and 
scientific categories. ` 
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IV.—Discussion. 

This is a small-scale experiment designed primarily to see whether ' set ' 
is likely to arise, and to have a deleterious effect, in the Brook Reaction test as 
normally given, that is, with the word order randomised with relation to poten- 
tial interest value. From this point of view, the experimental results are 
satisfactory : when words connoting similar interests were given consecutively, 
for six interest-categories, the effect of ‘ set ' reached significance level in only 
one instance. It is true that for the other five interests, some ' set' effects were 
discernible, but these were far from significant. When the results are summated, 
the influence of 'set' on the Experimental group is of borderline significance 
only. Furthermore, at least three or four consecutive words of potentially 
similar interest value were necessary before a ‘set’ was effected. The danger, 
therefore, of inadvertently introducing a 'set' in the Brook Reaction test 
proper (as taken by the Control group), which might obscure the relationship 
between stimulus-words and genuine interests is negligible. 

There was a certain amount of individual variation among subjects, both 
in the Experimental and the Control group. For example, one subject in the 
Experimental group gave the identical response to each of the following 
(consecutive) stimulus-words: “box, ‘race,’ ‘ bowl.’ The response in each 
case was ' horse ' and the subject commented later than he had been unable to 
rid his mind of the idea ' horse ' | These three stimulus-words appeared towards 
the middle of the sport ‘ set ' group for the Experimental subjects. 

On the other hand, a Control subject gave four out of five science responses 
to what purported to be a mixed group of five stimulus-words. The stimuli and 
responses were as follows: 


Stimulus-word Response 
: construction diagram 
punch drunk 
arc circle 
joint Fisher (geology) 
plot curve 


Such findings were rare, however, and probably tell us more about the 
individual subject than they do about the test. The results in general suggest 
that (i a misleading mental set is unlikely to be inadvertently introduced, 
when the words are presented as to the Control subjects, in random order and 
(ii) that interest-differences which obtain between arts men and scientists, as 
such, are reflected at least as strongly in the test score as they are in the ratings. 
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THE DEVELOPMENT OF SCIENTIFIC CONCEPTS IN 
CHILDREN (2) 


By W. H. KING 
(University of London Institute of Education) 


SuwMARY. This paper describes the responses of children to twenty science 
questions and relates these responses to age and ability represented by scores 
on a verbal and non-verbal test. A relationship is established for certain types 
of question between their facility values and the grades obtained by the 
children on the criteria. Rate of growth of scientific knowledge is found to be 
similar for boys and girls ; boys and girls of 10 years of age show no significant 
differences in knowledge of science concepts but there is significant differentia- 
tion in secondary school children. . 


I.—INTRODUCTION. 


A PREVIOUS paper in this Journal (Vol. XXXI, p. 1, 1961) discussed the results 
of an investigation dealing with the growth of children's scientific knowledge 
with increase of chronological age. The replies to seventy questions were 
examined and eighteen of these showed quite clearly a steady increase of correct 
answers with increase in age (loc. cit., Appendix 2). These eighteen questions 
with two more drawn from the original list were used as a basis for further 
study. There was no evidence from the previous research that any of these 
questions were unduly favourable to either sex. The form of presentation of 
these questions was decided at a discussion with the group of teachers who 
had volunteered to carry out this research in their schools. 


II.—THE INVESTIGATION. 


(1) Material. The twenty questions in the Science Schedule were produced 
on a standard form on which the replies could be recorded objectively by the 
children. All the children were given the Simplex Junior Intelligence Scale and 
the Raven Progressive Matrices (Revised Order, 1956) so that for each child 
the following information was available : 

(a) chronological age, 
(b) raw score on verbal test, 
c) raw score on non-verbal test, 
and (d) responses to each of the twenty questions of the Science Schedule. 


(2) Subjects. Teachers in twenty-six schools offered to co-operate in this 
research. This would have involved the testing of approximately 4,800 children. 
To reduce the size of the experiment to manageable proportions, the complete 
battery of test material was given to every fifth or sixth child (depending on 
size of class) on each class register. By this arrangement, 801 children (368 
boys and 433 girls) were tested ; they formed a random group from the partici- 
pating schools even though the schools themselves were not randomly selected. 
The oldest was 17 years 3 months and the youngest 5 years 4 months. This 
wide age-range was a decisive factor in the selection of suitable published tests. 
Of the twenty-six schools, seventeen were mixed primary schools, one infants' 
school, two secondary modern girls' schools, two secondary modern boys' 
schools, one mixed secondary modern school, one girls’ grammar school, one 
girls' comprehensive school and one mixed independent school. Twenty of these 
schools were in the Greater London area and six in the Home Counties ; 528 
children were at the primary stage and 273 at the secondary stage. 

Table 1 shows the age distribution according to sex and type of school. 
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242 Development of Scientific Concepts 


The seventeen children in the infants’ school and the eleven boys and the eleven 
girls in the one independent school who were under 11 years old were grouped 
under the primary heading. The forty-two girls in the comprehensive school 
were following secondary modern courses and were, therefore, included in the 
secondary modern figures. 


The smaller number of boys in the grammar school group (thirty-five boys 
compared with seventy-two girls) made the sample less representative than was 
desired. 


Table 2 summarises the nature of the sample by sex and stage of education. 


TABLE 2 








IIL.—THE RESULTS. 


The means and standard deviations on the four variables for all the children 
arid for boys and girls taken separately are given in Table 3. 


TABLE 3 


Mean Scores Standard Deviations 


Simplex | Matrices| Science | Age in Science 
(100) (60) |Schedule| months | Simplex | Matrices| Schedule 


53-5 35.5 26-1 21-5 11:5 3-3 


59-7 | 385 268 | 209 | 120 | 32 
571 | 871 268 | 214 | 119 | 3-3 





The average age of the girls exceeded that of the boys by eight months and 
this difference was significant at the -01 level. The difference in the means of 
the Simplex scores and Matrix scores were significant (.01 level) On the 
Science Schedule, considered as a test, the boys’ scores were just significantly 
higher (C.R.— 1-96). 

The intercorrelations of the four variables were calculated separately for 
boys and girls and since there were no significant differences, Fisher’s Z-technique 
was used to calculate the correlations on the complete 801 sets of results which 
are recorded in Table 4. 
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Matrices Science 
Concepts 


0-570 
0-767 





These high correlations might be expected because of the wide range of 
age and ability of the children. The Simplex Junior is the best predictor of total 
Science Scores and is significantly better (-01 level) than either chronological 
age or Matrices; Matrices in turn are a better predictor than age (significant 
at -05 level). 

The largest group of children tested in a single year group was those whose 
ages were in the range 10 to 10: 11; in all there were seventy-six boys and 
sixty-six girls from seventeen junior schools. The three boys and five girls from 
the independent school were not included. The separate means and standard 
deviations for these boys and girls (Table 5) on the Matrices, Simplex Junior 
and the Science Schedule did not differ significantly, neither did the inter- 
correlation of these variables (Table 6). 


TABLE 5 


MEANS AND STANDARD DEVIATIONS OF THE THREE VARIABLES FOR CHILDREN, AGES 10:0 
To 10: 11. 


Mean Scores Standard Deviations 


Science Science 
Simplex | Matrices | Schedule | Simplex | Matrices | Schedule 





38-0 12-2 13-3 9-0 2-4 
11:6 14-1 9-3 2-6 





TABLE 6 


INTERCORRELATION OF THREE VARIABLES FOR CHILDREN AcGzs 10 To 10: 11 
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Comparing these results with those obtained from the extended age-range, 
the fall in values of the intercorrelations is to be expected ; although the 10-year- 
old girls still obtain higher mean scores on the Matrices and Simplex Junior and 
the boys higher mean scores on the Science Schedule, these differences are no 
longer significant. This suggests that had there been a more representative 
sample of grammar school children, there might have been a narrowing of the 
difference in means on the Matrices, but a widening of the mean scores on the 
Science Schedule. 

Dr. Gertrude Keir reported correlations of 0-538 for boys and 0-588 for girls 
between the Simplex Junior and Matrices for a population of primary and 
secondary modern children using the 1938 version of the Progressive Matrices. 
The higher correlations now obtained in the present study may be due to the 
improved version of this test. 

With age partialled out, there were no significant differences between boys’ 
and girls’ correlations on the three remaining variables. For the pooled scores, 
the partial correlation between Simplex Junior and the Matrices was 0-629, 
compared with 0-64 for a single year group; similar partial correlations 
between Matrices and the Science Schedule were 0-403 for the pooled scores and 
0-34 for the single year group. 


Allowing for the fact that the scores in the single year group are still in- 
fluenced by age, the corresponding correlations of the scores of the children in 
the complete age-range, when age is partialled out, do not differ unduly from the 
correlations obtained in the 10 to 10: 11 group. The correlation between the 
Simplex Junior and the Science Schedule, with age partialled out, was -527. 
Again, the verbal test was a better predictor of science scores (significant at 
‘05 level). Further partialling out of variables yields figures which are less 
reliable but the results are recorded as they show the relative influences of verbal 
and non-verbal scores on the Science scores. 


TABLE 7 





The boys’ correlations differ significantly (-05 level) and the girls’ correla- 
tions also (-01) but the differences in the boys’ and girls’ correlations are not 
significant. 

As a test, the Science Schedule has a reliability of 0-604.? It was slightly 
more reliable for boys (0-60) than for girls (0-55). The corresponding 
reliabilities of the Simplex Junior and the Matrices, obtained by the same 
method were 0-96 and 0-91* respectively. 

The lowest correlation (0-504) is between age and science scores. The 
[o of age on the science scores was shown graphically in the previous research 

C. ctt.). 

* loc. cit., p. 143. In a discussion of the reliability of the Matrices (1938), Keir reports 
a reliability coefficient of 0-76 (split half method) and 0-64 for successive trials on a small 
sample of children. 
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The differences in the mean scores of the two sexes on the Simplex Junior 
were significantly different for the full age-range but not for the 10 to 10:11 
children. It has been shown that the girls did significantly better than the boys 
on the Matrices for the complete age-range (-01 level) but this difference was 
not significant for the 10-year-old group. This may be evidence again of the 
lack of a larger grammar school boy representation in the sample and if this is 
so in a fully representative sample, the boys’ scores on the Science Schedule 
would have been more than just significantly higher (C.R.==1-96) than the girls : 
nevertheless, there was no significant sex difference in the science, matrix or 
verbal test scores (means and standard deviations) for the 10-year-old group. 
Such differences as have been found in the sample of older children may be the 
results of differing types of schooling and a further study of the groups of children 
in different types of school in the age-range 13 to 15 years might supply a more 
positive answer. : 

The multiple regression equation to predict science scores confirmed the 
influences of the other three variables—verbal, non-verbal and age. This 
predictive equation in score form is 
X science = 0:82 X Simplex -+ 0-034 X matnæs -+ 0-009 X age in months + 4-07 
giving a multiple correlation coefficient of 0-684. Approximately 47 per cent. 
of the variance of the Science test is attributable to the joint action of age, 
verbal and non-verbal variables ; 36 per cent. and 7 per cent. are the contribu- 
tions of the verbal and non-verbal tests and 3-5 per cent. to the differences in age. 

There were too few children in the 5-year-old, 16- and 17-year-old groups 
so these children were included in the 6-year group and 15-year groups, 
respectively. Twenty graphs of the percentage correct response for ages 6 to 
15 were superimposed on the corresponding graphs for the same twenty graphs 
obtained in the previous study of primary school children. The two sets of 
graphs were similar for both primary school children; such differences as 
existed might be attributed to : (a) differences in sampling, (b) the standardising 
of the form of the answers in the second study and (c) fewer questions in each 
section which cut down the chances of practice. 

So far, the total scores of the twenty science questions have been compared 
with age, Matrix scores and verbal reasoning scores for boys and girls. An 
analysis by age, verbal and non-verbal score was obtained for each of the science 
pomi The raw scores were divided into five grades determined by their 

Oth, 40th, 60th and 80th percentiles, separately for boys and girls. The 
corresponding end scores, were calculated, checked graphically and plotted 
against correct percentage response for each science question. The graphs 
illustrated the similarity of trends in boys’ and girls’ responses. There were 
slight differences between boys and girls that might be attributed partly to 
sampling errors, but every graph showed a steady increase of correct percentage 
response throughout the grades E to A of the verbal and non-verbal tests. 

Direct comparisons were possible between percentage correct replies on 
each of the science questions and the ages and grades on the Matrices and Sim- 
plex Junior tests. From the eighty graphs comparisons were made between 
question difficulty (as measured by percentage correct replies) and age, verbal 
grade and non-verbal grade. For example, in the science question (No. 15. Are 
all things that move living?) at least 60 per cent. of the answers were correct 
at a minimum age of 9 years ; also with this 60 per cent. facility value, boys and 
girls had to be graded D at least on the Simplex Junior, and on the Matrices, 
boys required a C grading and the girls a D grading at least. The choice of a 
facility value of 60 per cent. was quite arbitrary, but seemed a reasonable figure 
to take for purposes of comparison. The ages and grades are given in Table 8. 
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TABLE 8 


PATTERN OF GRADES ON VERBAL AND NON-VERBAL GRADES WITH AGE CORRESPONDING TO 
60 PER CENT. FACILITY VALUE ON EACH SCIENCE ITEM. 


18; 7 


HS 




















Questions 1, 5, 7, 18, 19 and 20 were to» difficult and the only guide for this 
level of accuracy that can be deduced is that for Questions 1, 5 and 18 the age 
should be at least the age recorded, i.e., 14, 14 and 15, respectively. Less than 
60 per cent. of the children, at whatever their age or grade, gave correct replies . 
to Questions 17, 19 and 20. These questions were too difficult. A lowering of the 
facility value will reduce corresponding ages and verbal and non-verbal 
grades for each science concept. 


Question 13 dealing with the relative speed at a train of cogs, brings out 
the differences in the replies of boys and girls. To obtain 60 per cent. correct 
answers to this question, the children should be at least 11 years old ; the boys 
are likely to be a point above the fortieth percentile in the verbal test, girls 
above the sixtieth percentile; and the bovs over the sixtieth percentile in the 
non-verbal test and the girls at least at the eightieth percentile. 


IV.—DiscusstoN. 


The correlation between the scores of the verbal and non-verbal test 
(0-767) was the highest correlation recorded. This may be due to the wide 
range of age and ability of the children as well as the common factorial element 
of the-two tests. There was some evidence that the verbal test was a better 
predictor of success in the science questions than the non-verbal test. The 
difference of the correlation coefficients was highly significant statistically. 
Further evidence was provided by the relative proportions of the variance of the 
science test attributable to the verbal and non-verbal scores. The predictive 
powers of the three variables are in order (a) Simplex Junior test, (b)Raven's 
Progressive Matrices and (c) age. The science test was given in verbal form and 
it would seem, therefore, that a scholastic test with large g : v component might 
a priors be a better predictor of scientific attainment in the present study than a 
test which has a large ' g ' loading.? 

' Boys were consistently better than gir's on both the verbal and non-verbal 
criteria in questions relating to (a) shadow of post; (6) weights on see-saw ; 
(c) rotation of cog-wheels ; (d) shadows formed by edge of cardboard circle. 
These questions were not intrinsically more difficult than others where these 
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differences were not apparent; for example, there were no differences in the 
pattern of responses to questions 18, 19 and 20 which candidates of whatever 
age and ability found difficult. These three questions demanded imagination 
of the shape of the cut surface but the other four types of question required the 
application of mechanical principles which for most children may not have 
formed part of formal science lessons. If this is so, it seems likely that boys 
tended to draw more on practical out of school experience. 


The differences in the sample of boys and girls who took part in this study 
have already been noted. The girls were older, did better on the Matrices and 
Simplex Junior but the boys obtained significantly higher scores on the Science 
questions but with the 10 to 10: 11 group there were no significant differences 
in any of the variables. It would appear that the boys’ superiority on the science 
questions does not come about until after the children have left the primary 
school. 


When the percentile grades were determined separately for boys and girls 
- with the intention of equating the two distributions, the graphs of percentage 
correct response, first with the verbal grades and then with the non-verbal 
grades, gave graphs which were nearly identical so that allowing for differences 
in the boys’ and girls’ samples, the rate of growth of correct science replies 
compared with achievement on the verbal and non-verbal tests were almost 
identical. From this it might be deduced tentatively that differences in the 
attainments of boys and girls in science, e.g., at ' O' Level, may not be due so 
much to differences in verbal and non-verbal ability, but to other non-academic 
influences, e.g., teaching, interest motivation, self-involvement, etc., with which 
this study was not concerned. 
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VI.—ArPENDIX 1 (A). 


SCIENCE SCHEDULE. PERCENTAGE OF CORRECT SCIENCE SCORES. 


















Junior Simplex 
Percentiles. 











16 | 17 | 18 | 19 | 20 


77 | 58 | 29 | 25 | 27 
88 | 77 | 18 | 60 | 27 | 25 
















87 | 70 | 51 | 26 | 17 | 17 
80 | 67 | 15 | 42 | 19 | 11 
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87 | 83 | 99 | 88 | 72 | 87 | 87 
89 | 78 | 92 | 73 | 55 | 73 | 72 
87 | 84 | 95 | 86 | 61 | 81 | 71 
78 | 77 | 91 | 88 | 48 | 66 | 52 


79 | 73 | 89 | 77 | 49 
78 | 83 | 82 | 76 | 41 


76 | 62 | 55 | 88 | 40 | 49 
56 | 92 | 68 | 58 | 26 | 30 | 37 





























VII.—APPENDIX 1 (B). 


SCIENCE SCHEDULE. PERCENTAGE OF CORRECT SCIENCE SCORES. 
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Percentiles were calculated from boys’ and girls’ combined distributions. 
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VITI.— APPENDIX 2, 
Estimate the time in seconds between two taps. (1) 


A jar 1s partly filled with water. A stone is gently dropped 
into the water. Is the level of the water now higher, 


lower, or the same as it was before ? (2) 
is cop a aw A 
B 
C 
Look at the diagrams 


The first represents a glass of water. All the water is poured 
into the second glass. If the bottoms of both glasses are 
the same size, will the new level of the water be at 
A, Bor C? (3) 


Imagine two pieces of plasticine which have the same size 
and weight. One lump is rolled into the shape of a 
pencil. Will the two pieces now have the 
(a) same weight? (yes, no?) (4) 
(b) same volume? (yes, no ?) (5) 


Ice turns into water, sugar dissolves in water. 


Does steam turn into water ? (yes, no ?) (6) 
Does salt turn into water? (yes, no ?) (7) 
Does hail turn into water? (yes, no?) (8 
Do clouds turn into water? (yes, no?) (9 
— 2 
= A= 
ud T 


ft 


The drawing shows a man standing in a road in the light ofa 
lamp post. Will the man’s shadow fall along the lines 
A, BorC? (10) 
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A B C 


Hereis a diagram of a seesaw with a girl sitting at one end (c). 
If a boy is twice as heavy as the girl, where would he sit 
to balance the seesaw—at A, B or C? (11) 





Here are three cog wheels, A, B, C. 
Which cog wheel turns in the same direction as A. Is it 


B, Cor neither? (12) 

Which cog wheel turns the fastest ? Isit A, B or C? (13) 

Is a flower alive ? (yes, no, don't know) (14) 
Are all things that move living ? (yes, no, don’t know) (15) 


A needle is stuck upright in a cork. The cork floats on water. 
Draw two lines showing position of cork and needle. (16) 


A circle cut from cardboard 1s held edge on to an electric 
light. The shadow of this cardboard falls on a screen. 
Draw the shadow you would see on the screen. (17) 


A rubber ball is cut through the middle. Draw the shape 
made by the cut. (18) 


The top of a cone is cut off as in the diagram. 
Draw the shape made by the cut. (19) 
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A cone is cut in half, as shown in the diagram. 
Draw the shape made by the cut. (20) 
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LEADERSHIP IN A DUAL CULTURAL SETTING: A 
SOCIOMETRIC STUDY OF CLEAVAGE BETWEEN 
ENGLISH AND AFRIKAANS-SPEAKING SCHOOL CHILD- 
REN AND THE ROLE OF THE LEADERS IN BRIDGING IT 


By R. K. MUIR 
(Education Department, University of the Witwatersrand) 


SUMMARY. Moreno’s sociometric test technique was used to measure cleavage 
in the school-leaving classes of four parallel-medium high schools. On the basis 
of an analysis using sociograms and statistical tests, the two culture groups in 
the schools were found to co-operate in the environmental organisation of the 
schools, although many friendships between the groups were not made. The 
leaders were found to initiate contact in the environmental organisation of the 
schools and it is suggested that this contact leads to friendship ties. : 


I.—INTRODUCTION. 
SOUTH AFRICA is faced with several peculiar problems in the field of human 
relations. Not only is there the problem of Black and White races, but the 
White race itself is composed of two groups which have different cultures. The 
most marked outward manifestation of the two cultures is seen in the two 
languages, English and Afrikaans, spoken by the groups. In the schools the 
pupils of the two White groups have, by and large, been kept apart, and some 
educationalists, politicians, and indeed people from all ranks of society, have 
felt that by putting pupils from the two groups together to be taught in the 
same school, one step will have been taken towards unifying the White popula- 
tion. The assumption is that a more intimate knowledge of the members of the 
other group will lead to a liking for the group. Research has shown that while 
this is often the case, increased knowledge of another group can lead to antipathy 
and even stronger feelings of identity for one's own and rejection of the other 
(Herman and Schild 1961, Sherif and Sherif 1953). In any society in which there 
are sharply defined differences between groups composing the society, as in 
South Africa, there will have to be liking born of more immediate contact 
between the groups if harmony is to be achieved. 

Apart from separate educational provision for the two cultures, dual 
medium and parallel-medium schools have been tried. The former suffers from 
the waste of time in teaching and learning resulting from using two languages in 
the same classroom and the critics of the latter say that there is no contact 
between the groups anyway, so they are also a waste of time. This study was 
designed to examine the measure of cleavage in the structure of English and 
Afrikaans-speaking pupils who attend parallel-medium schools. Further, the 
aim was to examine the róle of the leaders in this dual cultural setting. 


II.—SOCIOMETRIC TEST TECHNIQUE AND GROUP STRUCTURE. 

A modified version of Moreno's sociometric test technique was used 
(Moreno 1953). The test made it possible for members of different cultural 
groups to choose each other, so cleavage between the cultures could be measured 
by the choice pattern between them. A choice expressed by a member of one 
culture group for a member of another has been called a cross-cultural choice. 

The technique has limitations and some assumptions underlie its use. 
Sociometric status is the combined score of all the choices received by the 
individual on the criteria in the test. This objective measure is indicative of how 
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the group evaluates each of its members. Those with a score of nought are 
called isolates and those with a statistically high score are called stars. Various 
interpretations may attach to star status and it can only be explained by 
reference to the social setting. The test technique merely identifies stars, but, 
generally speaking, stars are important to the group to ensure its continued 
existence and to uphold its cultural traditions. Stars act as spokesmen, initiate 
and direct action, set the values and goals to which the group aspires, and 
determine how they should be realised. Research has suggested that stars are 
better liked and tend to be highly evaluated in areas other than those which 
were the basis of their status. Further, they interact more with their fellows 
(Hurwitz, e£ alia, 1955). While it is important for the leaders to uphold the 
group’s way of life—and if they transgress too far from their position, they will 
lose their status—they are, nevertheless, given more latitude in their behaviour 
than are the members of lower status. They are expected to mix with other 
.groups in upholding the prestige of their own, and in doing so, they interact 
with the members of those groups. 

Interaction between members is expressed in a sociometric test in choice 
patterns and cannot be defined further than the pattern. A choice made by a 
member A of another B means that A has a desire to interact with B. B has 
also made choices and should B choose 4 it would seem that this mutual choice 
situation means that there is interaction between A and B. There is at the least 
a desire to associate on the part of both, and this has probably already led to 
- successful interaction which has reinforced the desire. 


The structure of the group becomes the sum total of types of choice patterns 
and the distribution of choices. When structure is considered the emphasis is 
not so much on numbers of choices but on their configurations. Why an 
individual should be accepted, why the members should choose in a particular 
pattern, or why the group should be so structured are questions which the 
sociometric technique alone cannot answer. Other methods of study would 
need to be developed to answer these questions. 


This study was of school populations in which cleavage could be expected 
between the English and Afrikaans-speaking pupils which composed the 
pulation, and the sociometric test has been found well suited to reveal cleavage 
in the structure of a group (Criswell 1947, Loomis 1943). However, it cannot 
explain the cleavage or say why cross-cultural choices were made in certain 
cases, although the pattern revealed is such that interesting conclusions can be 
drawn from it. 
. Each member of the test population was asked to make sociometric 
choices of friends and of leaders in the school standard, every opportunity 
being given for cross-cultural choices to be made. Choosing of friends can be 
regarded as inter-personal capacity for participating in the intensely personal 
organisation of the group, which Jennings called the ' psychegroup ' (Jennings 
1947) and Homans along similar lines called the ‘internal structure’ of the 
group (Homans 1951). Each has suggested a wider pattern of contact in the 
more formal outward organisation of the group—called by Jennings the 
'sociogroup ' and by Homans the ' external structure '—and leadership is a 
phenomenon of this structure. 

Although their approaches differ and the setting of their studies is different, 
the hypotheses formulated by both Jennings and Homans suggest that where 
two cultural groups impinge on one another, there will be more contact between 
them in the leadership structure of the group. Conversely, the friendship 
structure will be structured within the culture group with much less inter-group 
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contact. Homans made a further hypothesis that interaction in the external 
structure will lead to interaction in the internal structure. 

It was the aim of this study to see how these hypotheses held true for groups 
whose members belonged to different cultures. These two hypotheses were 
formulated using Jennings' terminology : 

Hypothesis 1. When two groups with different cultures are brought ` 
together in the formal organisation of school life, leaders will be chosen irrespec- 
tiveofthe group to which they belong, but friends will be chosen on an in-group 
basis. Cleavage will be marked in the psychegroup structure but not in the 
sociogroup structure. 

Hypothests 2. The leaders in the groups of English and Afrikaans-speaking 
pupils will interact in the sociogroup structure, and this interaction will result 
in interaction in the psychegroup structure. Thus, the leaders will bridge the 
cleavage between the groups. 


IIL.—THE RESEARCH METHOD. 


. ` After a preliminary trial, these four criteria were chosen for a final question- 
naire : 


(1} With which of the pupils in your standard would you like to remain in 
contact in the years ahead ? 


(2) Which of the pupils in your standard would you most like to invite to your 
home to enjoy leisure activities ? 

(3) Many schools given an award to the pupil who has served the school best. 
Qualities of leadership and ability to put school interests before personal 
interests are important considerations for the award. Which of the pupils 
in your standard do you think worthy of the award ? 


(4) Which of the pupils in your standard do you think have shouldered or could . 
shoulder the responsibilities of prefectship best ? 


The first two criteria, referred to as the friendship criteria, were of a general 
nature. By ‘ general’ is meant that a general area for activity is indicated in 
the criterion rather than a specific field of interaction ; the person is chosen for 
his personal qualities and the kind of person he is rather than for any specific 
skill of which he is capable. It was desired to elicit choices which would reflect 
the pupils’ desire to maintain a permanent pattern of group structure, so the 
first criterion was found very suitable. Both criteria were suitable for eliciting 
choices which would imply a desire to interact socially, however limited the 
social milieu of all or some of the members of the group may have been. The 
criteria were meaningful to all and did not leave any but the anti-social in an 
embarrassed position of being unable to make the choices. There was every 
opportunity in these criteria for reciprocal choice and mutual association. 

The last two criteria, referred to as the leadership criteria, were specific in 
nature. The basis for interaction was limited, in that in the choice of leaders, 
the choice process tends to favour a few individuals, and ignores a large number 
of them. The leadership criteria had meaning for all the experimental groups 
because the choice of pupils on the basis of these criteria is made annually by 
the school staff in order to present awards and prizes. The choice arouses great 
interest in the pupils and being given an opportunity to express a -viewpoint 
was readily accepted. 

The two types of choice criteria reflect activity in the psychegroup and 
sociogroup structures. The friendship criteria require choices on the basis of 
personal satisfaction. The choice is made of those who have personal attraction 
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for the person choosing, and there is the opportunity for the choice to be 
reciprocated. The process is two-way in that the choosers are expressing choices 
of each other, and so the reciprocal choices are revealed. The leadership criteria 
demand choices not on the basis of personal qualities of attraction for others, 
but rather on the basis of the contribution the leader can make to the group life, 
on his importance to the group and his ability to get things done. This choice 
process is one-way. All choices tend to go to the few, and the few tend to 
choose each other, so there is not the same opportunity for mutual choice. It is 
only the leaders who interact in a sociometric sense. 


The number of choices allowed on each criterion was five. School-leaving 
pupils have sufficiently developed social contacts to make this number of 
choices easily and studies have shown that five choices provide the most stable 
sociometric results (Gronlund 1959). A fixed number of choices also has pratical 
advantages in statistical interpretation. Only positive choices were required, 
as it was felt that resentment would be caused by asking for rejections. The 
spaces were given on the questionnaire for the choices to be filled in them. 

A carefully-prepared Afrikaans version of the questionnaire, in which it 
was ensured that the meaning and intention of the original English version was 
retained, was available for the Afrikaans-speaking members of the experimental 
groups. 

The questionnaire was administered with both a verbal and a written set of 
instructions in both languages, care being taken to establish rapport and to 
bring the members of the two cultures together so that cross-cultural choosing 
should be possible, without making it seem desirable. The four experimental 
groups with a total of 171 subjects were distributed in sub-groups which were 
designated as follows : 


Sub-group a—English-speaking boys. 
Sub-group b—English-speaking girls. 
Sub-group c—Afrikaans-speaking boys. 
Sub-group d—Afrikaans-speaking girls. 


Sub-groups a and c were very small in numbers in three of the experimental 


groups and for the purposes of this article these sub-groups have largely been 
omitted from the discussion. . 


The first experimental group consisted of the matriculation class of a high 
school, whereas the other three experimental groups attended commercial high 
schools in the Transvaal. The school-leaving classes were chosen as the members 
of the classes had known each other for a minimum of two years and some pupils 
had been together for most of their school life. This ensured that a stable and 
clearly-defined group structure had developed. Further, the leaders in the school 
community are found among the seniors and a hierarchy of leadership status is 
most evident among them. 


The schools were not randomly selected and the sub-groups were not well 


balanced in numbers, but as unilingual schools are the order of the day, this 
was unavoidable. 


In the experimental groups, three variables for division were operating: 
cultural differences, sex differences and the division of the groups into classes 
for instruction. To care for the last variable a control measure was used. In 
one experimental group a cultural group was divided into two classes for instruc- 
tion. The cleavage recorded in this case could be compared with that recorded 
when other variables were operating. 


R. K. Mur , 257. 


IV.—THE ANALYSIS OF THE RESULTS. 


Sociomatrices for each experimental group were prepared, the choices on 
friendship and leadership criteria being recorded separately. Both the results of 
the sociometric testing and the social situation in each school were examined . 
for evidence of two group structures. There was strong evidence of them, 
_ particularly in the social setting, but inter-correlations of the choices received. 
on leadership and friendship criteria ranged from -73 to -77. The status of the 
stars was next examined to ensure that they were leaders in their own in-group, 
otherwise any conclusions based on their cross-cultural choosing would have 
been spurious. 

Sub-groups a and c, the English and Afrikaans-speaking boys, were included 
in the analysis, but as six of these sub-groups consisted of between two and five 
individuals, outgroup choices were forced upon them and a pattern of choosing 
was revealed different from that obtained with the more typical girls' groups. 
Friendship choices were directed to boys rather than girls, but these special 
circumstances and sex cleavage, which is known to exist in the choice of friends, 
must be borne in mind. For these reasons the results for the girls' groups only 
have been considered in the following discussions. 

Two types of analysis were undertaken: (a) the reciprocated and partially 
reciprocated choices were graphically depicted by means of sociograms and 
(b) statistical tests were applied to determine the level of significance of the 
volume of cross-cultural choices recorded. 


(a) The Soctograms. 

Each sociogram consisted of four ‘concentric circles with the members 
identified’ by numbers and arranged within the circles according to their socio- 
metric status and so as to show their interactions with their fellows most clearly. 
Bronfenbrenner’s (1945) fixed frame of reference was used, and apart from one 
case where the numbers of choices which were made fell so low that statistical 
limits also needed to be lowered, those members with sixteen choices and more 
appeared in the centre, those with more than ten in the next concentric circle, 
those with four and more in the third and those with less than four in the outer- 
most ring. These limits range from the -02 to the -05 level of significance, 
depending on the numbers in the experimental groups. The girls were identified 
by a number placed in a small circle and the boys by a number placed in a 
triangle, and then the reciprocated choices (mutual pairs) were shown by a solid 
line and partially reciprocated choices by a dotted line. The English-speaking 
members of the test population were placed in the upper segment of the circle 
and the Afrikaans-speaking in the lower. All cross-cultural reciprocations were, 
therefore, easily seen by the lines crossing these segments. 

The sociograms drawn on the basis of friendship criteria gave a clear 
diagrammatic picture of psychegroup structure (see Fig. 1 for an example. 
Three isolate boys are not shown). Those in the centre were highly chosen and 
the complex network of solid and dotted lines connecting them with each other 
and with those less chosen show that they made the most satisfactory interac- 
tions. The stars had as many as six of their ten choices reciprocated either on 
both criteria or on one, and to have five choices reciprocated wholly or partially 
was common. Those placed nearer the second circle were connected with the 
stars and members of like sociometric status in triangular patterns. However, 
the number of reciprocations declined and those in and near the outermost circle 
had fewer reciprocations still Those who were significantly underchosen 
obviously yearned for interaction with the more important members of the group 
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and not with each other. The lines expressing mutuality of choice sent out arms 
from the centre embracing fewer and fewer members as the outer sections of the 
sociogram were reached. In the outermost circle there was a chain pattern with- 
out the complexities of the triangle and clique patterns so prevalent in the 
centre. There was no successful cross-cultural interaction as expressed by 
reciprocal choices for Experimental Group I, although eighteen cross-cultural 
choices were recorded. In the other experimental groups there were ten - 
occasions when cross-cultural interaction was recorded, and these interactions 
involved fourteen members. Of these, six were stars on friendship criteria, none 
was significantly underchosen, and on only two occasions was a star not a party 
to the reciprocation. They maintained their status on leadership criteria. 





FIGURE 1 


AN EXAMPLE oF A SOCIOGRAM DRAWN ON FRIENDSHIP CRITERIA 


Upper segment—English-speaking group. 
Lower segment—Afrikaans-speaking group. 
Solid lines indicate complete reciprocation. 
Dotted lines indicate partial reciprocation. 


On leadership criteria the difference between members of high and low 
sociometric status was very marked (see Fig. 2 for an example). The stars 
interacted, but those outside the innermost circle were desultorily included, 
and large numbers were in complete isolation in the outermost circle. This is 
explained by the nature of the criteria and the tendency for the choice situation 
to be one-way. However, in choosing themselves the stars did not recognise 
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FIGURE 2 


An EXAMPLE OF A SOCIOGRAM DRAWN ON LEADERSHIP CRITERIA. 


Upper segment—English-speaking group. 
Lower segment—A frikaans-speaking group. 
Solid lines indicate complete reciprocation. 
Dotted lines indicate partial reciprocation. 


cultural barriers. In all four experimental groups, there was cross-cultural 
interaction among the stars and a rejection of the remainder of the population. 

Of the total of twenty individuals involved in making cross-cultural 
choices, sixteen were stars and the remaining four had a choice status little 
short of that of a star. A comparison was made of individuals who had achieved 
star status and interacted cross-culturally on leadership criteria with those in a 
similar position on friendship criteria. Among the twenty in this category on 
leadership criteria were included all six who were in this category on friendship 
criteria and eleven who achieved star status on friendship criteria. 

The control group showed a cleavage between classes but also between two 
cliques on friendship criteria, and the cliques were not based on class division. 
The leaders of each clique were in one class and drew followers from both. This 
clique pattern and class cleavage disappeared with leadership criteria, which 
indicates that the leaders bridge cleavage whatever the variable producing 
cleavage may be. 

From the evidence of the sociograms these conclusions would seem to emerge: 


(1) Cross-cultural interaction is greater in the sociogroup structure than in the 
psychegroup structure of the groups. 


(2) Cross-cultural interaction is initiated by the leaders. This is particularly 
so in the sociogroup structure. 
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(3) The fact that cross-cultural interaction in the psychegroup and sociogroup 
structures was largely confined to the leaders would suggest that contact 
in the latter group structure leads to contact in the former structure. This 
would be a reasonable conclusion, although conclusive proof of it cannot 
be given. 


(b A Statistical Analysis. 

The volume of choice directed from one cultural group to another indicates 
a willingness to interact, and as the sociograms depended on mutual choices 
which were not encouraged by the nature of the leadership criteria, other 
methods of interpreting the results were sought. Criswell’s double-ratio method 
(1943) was used to measure in-group preference and sub-group 's relation to d 
and vice versa produced consistently lower indices for leadership criteria. This 
index indicates the drawing power of the in-group and the bigger the index the 
greater the degree of self-preference. Therefore, the self-preference indices 
confirmed that in the sociogroup structure there was not as much preference for 
the in-group as there was in the psychegroup. 

However, a statistical design was wanted which would test the chance 
occurrences of differences in the choosing pattern, and overcome the difficulties 
commonly met with in sociometric results of skewness of the distribution and 
inflation of the significance if number of choices given or received is made the 
basis for analysis. A test which ignored the fact that each individual was 
permitted several choices on each criterion and which grouped choices together 
as if each choice came from a different individual would give spuriously inflated 
levels of significance. The individual making the choices had to be taken as the 
basic unit. 


Question 1. 

The first question which it was desired to answer was whether there was 
more out-group choosing on leadership criteria for each cultural group. The 
hypothesis was that cross-cultural interaction was greater in the sociogroup 
structure than in the psychegroup structure of the groups. 

For each of the psychegroup and sociogroup structures an index of the 
individual’s in-group and out-group choosing was found. This preference index 
was found for each of the two group structures using the formula 
I—-O 
O-FÍ 
where I and O represented the choices, the individual made to the in-group and 
the out-group, respectively. Two scores for each individual were thus obtained, 
one score IPy derived from the choices made on friendship criteria and the other 
score IP; derived from choices made on leadership criteria. It will be seen that 
` both IPr and IPr scores can range from +1 (the in-group wholly preferred) 


to —1 (the out-group wholly preferred). The difference, d, was then obtained 
for each individual, thus : 


d=IPr —IPL 


The members of each experimental group were then placed in one of three 
categories according to their d-values being greater than, equal to, or less than 
zero. The results are shown in Table 1. It will be seen that for all groups there 
are more individuals in category 1 with d greater than zero than in Category 3 
with d less than zero. This would suggest that the preference for the in-group is 


IP= 
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greater for the friendship than for the leadership criteria—which is the hypo- 
thesis under examination. A test of the significance of the differences between 
the two sets of frequencies is provided by the Binomial Expansion, from which 
one can derive the probabilities of obtaining frequency differences as great as 
those between Category 1 and Category 3 in Table 1. The probabilities appear 
in the last column of the Table. 


TABLE 1 
DISTRIBUTION OF MEMBERS OF GROUPS ACCORDING TO THE DIFFERENCE, d (BETWEEN THE 


PREFERENCE INDICES ON THE FRIENDSHIP AND LEADERSHIP CRITERIA) BEING GREATER 
THAN, EQUAL TO OR LESS THAN ZERO. 


Significance of 

difference of 

Experimental| Sub-group | Category 1 | Category 2 frequencies in 
Group d>0 


d=0 Cat. 1 and Cat 
expressed as a 
P-value. 


ar act at At 





TABLE 2 
DISTRIBUTION OF STARS AND Non-Stars ACCORDING TO NuMBER OF OUT-GROUP CHOICES 
MADE. 


Out-group choices made Level of 
Significance 





0—1 





Stars .... 
Non-stars 17 





Stars .... 2 


Non-stars 


Stars .... 
Non-stars 





Stars .... 
Non-stars 





* Not significant. 


262 Leadership in a Dual Cultural Setting 


For experimental groups III and IV high levels of significance were obtained 
while two further sub-groups, one from each of the other two experimental 
groups, provided significant P-values. It can, therefore, be safely concluded that 
there is more cross-cultural interaction in the sociogroup than in the psyche- 
group structure of the experimental groups. 


Question 2. 

The second question which was asked was whether the stars gave out- 
group choices more readily than the non-stars on the leadership criteria. Leader- 
ship choosing only was examined for this purpose and the distribution of out- 
group choosing for the stars was tested against that for the non-stars. A 2x2 
contingency table was formed for each experimental group by grouping the 
stars into two categories—those who had made one or no choices from the out- 
group in the one category and those who had made two or more choices among 
the out-group in the other. The non-stars were arranged in the same categories 
and the significance of the difference between the two distributions was tested 
by a Chi-squared test. In calculating the values of chi-squared for the different 
2x2 tables, Yates' correction was applied when any of the expected values 
fell below five. It was not necessary to apply the exact method for 2x 2 tables 
since the chi-squared values in all cases were either highly significant or not at 
allso. The results are shown in Table 2 where it can be seen that the non-stars 
made out-group choices as readily as the stars. 


Question 3. 

The third question which was asked was whether the stars received out- 
group choices more readily than the non-stars. By definition, the stars receive 
significantly more choices than non-stars, but in this test out-group choices only 
were examined to compare the acceptability to the in-group of out-group stars 
and out-group non-stars. A 2x2 contingency table was formed. The same 
procedure was followed as for question 2, but the choices which stars and non- 
stars received formed the basis of the distributions. In Table 3 it can be seen 
that the distributions thus formed had a 1 per cent. level of significance for three 
of the experimental groups. This means that the stars not only achieved their 
status in the in-group, but also were recognised as stars by the out-group. 


TABLE 3 
DISTRIBUTION OF STARS AND NoN-STARS ACCORDING TO NUMBER OF OuT-GROUP CHOICES 
CEIVED. 


Out-group choices received Level of 


Significance 





0-1 24- 








Stars . . 
Non-stars 





Stars .... 
Non-stars 








Stars .... 
Non-stars 





Stars .... 
Non-stars 


AW], PO! NN] Of 
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The answer to questions 2 and 3 is that out-group stars attract choices from 
both stars and non-stars in the in-group. Stated another way, -there is no 
difference between stars and non-stars in the giving of choices to out- -group 
members, but these choices are given only to the stars in the out-group. 


V.—CONCLUSIONS. 


Both the sociograms and the statistical tests showed that the first hypo- 
thesis can be accepted with one reservation. Leaders were chosen cross- 
culturally, but the leader of the in-group was preferred. There was not a 
complete absence of cleavage in the socio-group structure. That the leaders 
bridge the gap between the culture groups as stated in the second hypothesis can 
be accepted for the sociogroup structure only. Further, the evidence of the 
sociograms suggests that there is the strong possibility that interaction in the 
sociogroup structure leads to interaction in the psychegroup structure, although 
no adequate proof can be given of this. 


VI.—THE RELEVANCE OF THE RESULTS IN THE SCHOOLS. 


These results indicate that given the opportunity of associating in parallel- 
medium schools the pupils do co-operate in the environmental organisation of 
the school. They need to play games together, not as members of opposite teams; 
they need to work together as prefects, or as members of school clubs and 
societies ; and they need to take part in the same plays, speech competitions 
parades and so on. Particularly, the leaders require to be brought into contact 
as they bridge the cleavage. In South Africa there are several parallel-medium 
schools with a proud tradition and those who know them are convinced that 
differences can be a source of strength rather than weakness, as out of diversity 
comes growth. Furthermore, after school days, the two cultural groups work 
together, frequently inter-marry and show goodwill in many ways. 

On the other hand, the single medium school has become the declared 
policy of the political party ruling at present and separation is being perpetuated 
even in military training after school. Without the lead given by the top 
administrators in education, and without the extra effort needed in organisation, 
administration, and even in tact, to make parallel-medium schools a success, 
they are doomed to failure. Asin the school populations, in the national life 
it is the leaders who must initiate contact between the groups, and without 
it separation and isolation will be perpetuated. 
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THE LEARNING EFFECTS OF PICTURES* 


By OLOF MAGNE anp LENNART PARKNAS 
(University of Gothenburg, Sweden) 


Summary. The present study concerns the problem of the learning effects of 
projected pictures when used as instructional aids and the validity of measure- 
ment techniques in media research. 

Two hypothetical functions of pictures are discussed: The ‘ information 
value ’ and the ' motivation value ’ of the instructional material. 

In order to study the information value of pictures, three Sero 
were designed to test the following hypotheses : 

The method of testing defines which part of the retention (verbal or 
pictorial) we intend to measure, and 

A method of testing, adequate to the mode of presenting the information, , 
covers a larger part of the total retention than a method that is inadequate to 
this mode. 

Different materials were constructed 1n order to give the subjects verbal 
or pictorial information and to measure retention. The results supported the 
hypotheses. At the same time the experiments suggested that, under the 
circumstances studied by the authors, pictorial learning was superior to verbal 
learning. : 


I.—INTRODUCTION. 


A LARGE number of authors have demonstrated that learning effects are in- 
creased if visual aids are used. 

A survey of the literature dealing with this field of research will be found, 
for example, in the works of Hoban, Dale and Finn (1949), Hoban and van 
Ormer (1951) and May and Lumsdaine (1958). Moreover, it may be mentioned 
that the effect of instruction material is discussed in teachers' guides, such as 
those by Dale (1954), Wittich and Schuller (1957), the two year books concerning 
audio-visual methods (1949, 1954), published by the National Society for the 
Study of Education, and a very large number of works on teaching individual 
subjects such as geography, biology, mathematics, and physics. 

The authors of the present investigation have interested themselves in the 
effectiveness of projected pictures when used as instructional aids. There have 
been comparatively few studies on this subject. In general, the findings show 
that pictures are passably effective, provided the teacher instructs the pupils 
how to study the content of the pictures (Buswell 1935). Other investigations, 
however, have led to exactly the opposite conclusion. That pictures produce 
only insignificant learning mati at least in some cases, is shown, for example, 
by "andere (1950), Stampolis (1953), Duregard (1953), Vernon (1951, 1953, 
1954), and Kleben oT Ne Terkedal (1955) has pointed out the importance of 
testing the children’s knowledge by an adequate method in relation to the 
information given. 

In Vernon’s first experiment (1951), two texts, each comprising about 400 
words, were used ; one of these was not illustrated, and the other had graphic 
diagrams. Vernon suggests that the graphic illustrations obstructed the 
retention of the text and that this result indicates that the subjects (girls aged 
16-18) lacked the ability to profit by the knowledge the diagrams were intended 
to convey. 


* Translation into Enghsh by W. H. Hilton-Brown, Stockholm. 
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In Vernon’s second experiment (1953) four short articles on two different 
topics were used ; and in each case the material was presented in two versions. 
One version was written in a popular style and was illustrated by pictures 
that were expected to arouse interest. The other version was more concise and. 
factual, and contained graphic illustrations and tables. The subjects (boys and 
girls, aged 16-18) were required to recall the contents orally and to answer 
certain questions. Vernon was unable to observe any appreciable difference in 
retention between the two articles. 

A third study by Vernon (1954) was carried out with subjects who were 
12 years of age. Also in this case, the illustrations appeared to exert only an 
insignificant influence. 

Klingberg (1957) has conducted an investigation in accordance with 
Vernon's studies. Klingberg interprets the results as follows: (1) retention of 
texts with pictures is about the same as retention of unillustrated texts; and 
(2) pictures hinder children from reading a text with their full attention. 


IL—MEASUREMENT OF LEARNING EFFECTS OF ILLUSTRATIONS. 


These conflicting results raise the question whether it is clearly understood 
what is being measured. What should be the criteria for the choice of illustra- 
tions? The question can be extended to cover all instructional material. If we 
disregard the social aspect of education and concentrate on the acquisition of 
facts, we must judge entirely by the effectiveness of the instruction as defined by 
the degree of retention a certain time afterlearning. The degree of clearness has 
been employed as a rule of thumb for the appraisal of effectiveness. This outlook 
has been systematized in what is known as the cone of experience, a diagram 
which we reproduce from Dale (1954) (Fig. 1). 

Some authors have maintained that the cone form indicates a reduction in 
learning effect with increasing abstraction. 

Although this approach is based on teaching experience, we are convinced 
that learning effects are not dependent merely on the degree of clearness. We 
may imagine a verbal representation such as a newspaper article and a first- 
hand experience in the form of an interview, both having the same content, 
where the first yields greater retention than the second. Consequently, there 
seem to be other functions (or ' dimensions’) of the instructional material 
which influence effectiveness. We consider that the degree of information about 
the object is one of the functions of the instructional material ; and we refer to 
this function as the ' information value.’ 

If we return to the example of the newspaper article, and the interview, it 
can be asserted that their information value is the same. Should we find different 
degrees in retention, this must be due to another function or ' dimension.' Thus, 
the pupils who do the interviewing may, for some reason, have attained a positive 
or negative attitude to the material. There seems to be a qualitative (as opposed 
to a quantitative) or an illustrative (in contrast to an informative) aspect of the 
learning material. With regard to this group of factors, we speak of the 
‘ motivation value’ of the instructional material. 

With this conception as a starting point, in experiments on the learning 
effects of illustrated and unillustrated texts, we ought to avoid using pictures 
with a high motivation but a low information value and then drawing general 
conclusions concerning learning with the aid of pictures in contrast to learning 
without pictures. 

Furthermore, with regard to the problem of measurement it hardly appears 
defensible to measure the retention of pictures only by verbal tests. 
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FIGURE 1 
TRE ‘“ CONE or EXPERIENCE.” 


We presuppose there is yet another factor that has to be taken into account, 
viz., the aim of learning. If we wish the pupils to give an adequate account of 
retained visual concepts, visual types of learning material should be selected ; 
but, if we wish an account in the form of a first-hand experience, the mode of 
learning should be a first-hand experience ; and finally, if the pupils have to 
acquire verbal knowledge, the material to be studied should preferably be 
presented in a verbal form. 


IIT.—H YPOTHESES. 
Thus, in order to study more closely this complex of problems, we have 
designed three experiments to test the following hypotheses : 
(1) The method of testing defines which part of the retention we intend to 
measure. 
(2) A method of testing, adequate to the mode of presenting the information, 
covers a larger part of the total retention than a method that is inadequate 
to this mode. 
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If these hypotheses are confirmed, a comparison may be made between 
verbal and pictorial learning or between different combinations of these two 
modes. Comparisons of this kind will give a more detailed conception of the 
relative effectiveness of the various learning methods. 


Experiment 1. 

In Experiment 1, the authors aimed at making comparisons between learn- 
ing with words and with pictures, Though these comparisons were to be as 
direct as possible, they were, however, to resemble actual teaching situations. 

The learning material (a ‘set of colour slides) was projected on to a screen 
in the classroom ; the subjects had to study each slide for 3 minutes. Each 
slide represented four plants, and of every combination of four plants, there 
were two versions: one consisting of a coloured picture of the four plants 
(see Fig. 2) and the other of a verbal description. The test consisted in judging 
whether, in a second slide the petals, leaves or roots of the plants had been 
changed. There were two versions also of the test picture: . a ‘ pictorial’ 
(see Fig. 3) and a verbal representation. 

Thus, information could be furnished verbally or pictorially, and retention 
could be measured either by verbal or by pictorial means, i.e., there were four 
experimental conditions scheduled in Table 1. 


TABLE 1 


EXPERIMENTAL CONDITIONS. 


Type of Information 





The subjects were 228 boys and girls aged 13. Each class was divided into 
two halves so that the average marks for science should be the same. 

Each group was shown a slide with- information, and immediately after- 
wards, the corresponding test slide. The slides for the verbal versions were 
designated W,, Wa, Ws, and W,: and for the pictorial versions P}, Pa, P, and 
P,. One half of a class was given the following information series: P,, We, Ps, 
W,, and the other half was given : Wi, P4, Ws, Pa. This applied to all the classes. 
"The test slides were varied in different combinations. -For each experimental 
condition (PP, WW, PW, WP) there were four tasks, which will be referred to 
as items. Item 1 was composed, for instance of P,W,, P,P,, W,P,, and W,W,. 

Retention was determined by assigning plus scores for correct answers. 

The results were subjected to a statistical analysis comprising variance 
analysis, t-testing and x?-testing of each item. Table 2 gives a summary of the 
p-values for the different significance tests. 

* Our results suggest that there are differences between the various experi- 
mental situations which cannot be due to chance. 
, . Moreover, in three out of our four items we can show that pictorial learning 
is superior to verbal. learning, when retention is measured by both pictorial 
tests and verbal tests. In item III; no significant difference was observéd between 
the modes of presentation. If retention is measured only by pictorial tests, 
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TABLE 2 
P—VALUES FOR THE DIFFERENT SIGNIFICANCE TESTS. 


(The experimental situation which afforded tthe highest mean value is mentioned first, 
unless otherwise stated). 


Number of Item 


Variance Analyses 
Between experimental situations 
Mode of presentation (Px W) . 
wW) 





* PW had the highest mean value 
ttt Indicate significance on the 0:001 level. 
tt Indicate significance on the 0-01 level. 
t Indicate significance on the 0-06 level. 


pictorial learning is found to be superior to verballearning. In this experiment 
we obtained differences for all our four items ; these differences indicated the 
superiority of pictures. There is only a 0-001 probability of this result being due 
to chance. If, on the other hand, retention is measured only by verbal tests, 
we found that verbal learning was superior although this was not very pro- 
nounced. 

Furthermore, we can report tendencies and, in two cases, significant 
differences which indicate that the picture tests cover a greater part: of the 
total retention. 

f Moreover, for all items, it was found that a higher degree of retention was 
recorded when tests adequate to the mode of presentation (i.e., PP or WW) 
were used than when the tests differed from the information type (i.e., PW or 


). 
In order to ascertain what the strongly significant interaction, which occurred 
in all items, may indicate, we shall study item II in a four-fold table, where the 
frequencies are the sums of the plus scores. 


TABLE 3 
Scores or Item II. 


Information 
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From Table 3, it is apparent that the difference between the retention of 
pictorial information, measured by pictorial tests and by verbal tests (64) is 
greater than the difference betweea the retention of verbal information measured 
by pictorial and verbal tests respectively ( —26). Had there been no interaction, 
both these differences would have been about the same. Similar results were 
obtained with regard to the difference between the retention of pictorial 
information measured by pictorial tests and verbal information measured by 
pictorial tests (62); and the difference between the retention of pictorial 
information measured by verbal tests and verbal information measured by 
verbal tests ( —28). The direction which the differences tend to take is also 
apparent. It is interesting to note that PP (frequency 206) is superior to WP 
(frequency 144) to approximately the same extent as it is superior to PW 
(frequency 142) ; and that WW (frequency 170) is similarly superior to WP to 
about the same extent as it is to PW. From a practical point of view, this 
means that we do not obtain any difference between the retention of information 
given pictorially and measured verbally and the retention of the same informa- 
tion given verbally and measured pictorially. This was apparent also from the 
t-testing, where we found that PW tended to predominate in two cases, and WP 
in two other cases ; but there were no significant differences. 


The argument indicated in the preceding section applies also to I and III, 
and, tc a great extent, to IV. 


In short, our results may be said to support the hypothesis that the method 
of testing influences the definition of retention. If it is intended to compare 
pictorial verbal knowledge or to test the retention of pictorial and verbal 
informatiou, then retention should be measured by both pictorial and verbal 
tests. 


At the same time, the experiment verified that, in the case studied by us, 
pictorial learning was superior to verbal learning. 


Experimeni 2. 

The learning material in Experiment | was used in an experiment with a 
somewhat different design, where the subjects took part in a diploma course in 
education at the University of Göteborg. 


The design of this experiment was as follows: The participants were 
divided into -four groups. In Experiment 2, each group formed two teams by 
lot, in such a way that both instruction (pictorial and verbal instruction, 
respectively) and tests (pictorial tests or verbal tests) were varied in the groups 
as well as in the teams. The time of exposure for the instruction pictures was 
2 minutes, whereas for the test pictures there was no time limit. 


Table 4 summarizes the comparisons between the four experimental 
conditions in Experiment 2. 
The sequence of the experimental conditions, in accordance with the size of the 
mean, was the same in Experiments | and 2, with the exception of verbal 
information tested by pictorial tests (WP) and pictorial information tested by 
verbal tests (PW) ; but in both cases the difference between these tests was 
insignificant. 

Furthermore, we found that in both experiments, the mean retention of the 
pictorial information was greater than that of the verbal information. 

What is of the greatest interest, however, is the more important result that, 
even among adults, the means for PP and WW are higher than those for PW 
and WP. If the retention of the pictorial and the verbal information had been 
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TABLE 4 
MEANS AND t-TESTING OF DIFFERENCES BETWEEN MEANS IN EXPERIMENT 2, 
Desig- |No. of Mean t for differences 


Experimental Conditions nation| sub- 
, jects ww WP PP 





Pictorial information— 
. pictorial tests 20 14:50--0 45 | 2-39} | 4:24+t| 4:6011 
Verbal information— 

verbal tests 20 12-95--0-47 — 1:58 2-16t 


Verbal information— 
i 21 12-0040-38 | 1-58 — 0°76 


verbal tests 20 |-11-55+0-45 | 2-16t | 0.76 — 





t—Significance at 0-05 level. 
tt—Significance at 0-001 level. 





tested only by the pictorial test, a significant difference would have been 
obtained which could have implied that the retention of the pictorial information 
was greater than that of the verbal information. A similar investigation, where 
only the verbal test was applied, would have shown that verbal information was 
more effective than pictorial information. Thus, quite different conclusions can 
be drawn, depending on the test method employed. 


Let us now imagine a practical learning situation where the information is 
resented both verbally and pictorially. The following two cases will then be of 
Interest : 


(1) The retention of this combination of pictorial and verbal information is 
tested by pictorial tests. On the basis of Experiments 1 and 2, we may expect 
retention to be at least as high as that of pictorial information alone. 


“ (2) The retention of the combination of pictorial and verbal information is 
tested by verbal tests. In this case, retention should be greater than when 
pictorial information is tested by verbal tests, and, in accordance with the 
results of Vernon’s and Klingberg’s investigations, the degree of retention should 
be about the same as that of verbal information alone. The combined retention 
measured by verbal and pictorial tests should then be found to be higher than 
the retention of verbal information. 


In order to test these hypotheses a third experiment was conducted. 


Experiment 3. 


The information consisted of a lesson on Greenland given both with and 
without a filmstrip. The teachers who took part in the experiment made a 
joint plan for the lesson on the basis of the script for the filmstrip and the test. 

e only difference between the two types of lessons was the showing of the 
filmstrip. Each lesson took 20 minutes ; testing occurred immediately. after- 
wards; no time limit was imposed. 

In all, 192 boys and girls, aged 11-12, took part in the experiment. As 
instructors, three teachers were employed who bud thorough experience of the 
age-group in question. In order to eliminate, as far as possible, any teacher 
effects, the teachers replaced each other in the different groups. 
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The-test form contained five verbal questions (Nos, 1-5) and five pictorial 
questions (Nos. 6-10). 

The results of the testing with the verbal items 1—5 are summarized in 
Table 5. 


TABLE 5 


SCORES ON VERBAL ITEMS. 


Experimental group 





Classes without a filmstrip 460 — yx*=0-11 
Classes with a filmstrip - 450 0-70<P<0-80 





Thus, no statistically significai difference was observed between the groups 
taught with a filmstrip and those taught without one, when retention was 
measured by a verbal test. Our results agree well with Vernon’s and Klingberg’s 
observations, and also with the hypothesis formulated by us. - 

The corresponding results of testing with pictorial items 6-10 are shown in 
Table 6. 


TABLE 6 


SCORES on PICTORIAL Items. 


Experimental Group 


Classes without a filmstrip 


Classes with at filmstrip 





There was a significant difference in retention between the group taught 
with a filmstrip and that taught without one when retention was measured by a 
iclorial test. This is in agreement also with the hypothesis formulated by us ; 
on the other hand, it does not agree with the findings of previous investigations. 


Hence, in this experiment, also we were able to demonstrate that the 
estimated retention depends to some extent on the character of the tests. 
Consequently, it is necessary to be careful when drawing conclusions with 
regard to learning with and without pictures, if retention is measured only by 
verbal tests or only by pictorial tests. Naturally, the validity of our results is 
restricted to the experimental situation studied by us. We would point out, 
however, that this resembles very closely real classroom situations. It is, of 
course, impossible to draw any direct conclusions concerning the knowledge 
acquired by the reading of texts with and without illustrations, but there is much 
to indicate that this problem is similar to the problems studied by us. 


L 


274 Learning Effects of Pictures 


We would summarize this experiment as follows: When applying verbal 
tests, there were only small differences between the two groups of children. 
On the other hand, it appeared that retention, tested by pictorial items, was 
greater for the groups taught with a filmstrip than for those taught without one. 
Consequently, we are apparently justified in concluding that, in the situation 
we investigated, teaching with filmstrips yielded a higher degree of combined 
retention than did teaching without filmstrips 


IV.—DIscussion. 


It has been said that the best and most effective form of teaching is to give . 
the pupils direct first-hand experience. Naturally, this is not always possible. 
At times, this can be done in Physics, Chemistry, Biology, and Mathematics. 
In certain circumstances, it may also be arranged in vocational schools and in 
connection with practical work in workshops But, if knowledge is to be acquired 
of foreign languages or foreign countries and cultures, this has to be com- 
municated by the aid of pictures and recorded speech. It is in order to communi- 
cate this type of knowledge that audio-visual aids are required. Though they 
represent an indirect form of knowledge, they are regarded as more effective 
than verbal symbols. 


In what is known as the cone of experience, which was previously referred 
to, reality is represented by the base, and the farther we proceed from this, the 
farther we are removed from reality. The cone represents an approach, where 
a direct and simple causal relation has been assumed to exist between proximity 
to reality and teaching results, Naturally, such a simplified representation of the 
effects of teaching methods lends itself readily to criticism, 


Instead of the cone of experience, we suggest a model where the information 
and motivation values of the medium are the decisive factors. If we suppose 
these values to be located on the axes of a co-ordinate system, thea a number of 
experimental situations become self-evident. A series of such experiments would 
confirm the validity of this model. Then we would find, for specific teachirg 
units, the combination of motivation and information values that yields maxi- 
mum retention Let us consider the following cases respecting the amount of 
retention : 


(1) Varying the motivation value while keeping the information value 
constant (teachers' ratings may be used for grading the motivation value). 


(2) Varying the information value, motivation value constant. 


(3) By simultaneously varying the information value and the motivation 
value, it would be possible to determine the conditions under which a maximum 
of retention is obtained. 


We leave it an open question whether the negative parts of the axes of the 
hypothetical model correspond to actual conditions and whether the axes of the 
model are to be assumed to form right or oblique angles, i.e., if information and 
motivation are correlated or not. 


If the acquisition of knowledge is regarded from this standpoiat, it will be 
admitted that there are direct, ‘ pictorial’ teaching-methods which are ineffective 
and verbal methods which are effective in respect of learning. 
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There is, however, a further objection to the approach that has given rise 
to the cone of experience. We have been able to show that it is not defensible to 
measure the retention of various types of learning with tests representing only 
one type of material. The investigation indicates that pupils should be allowed 
to study pictures, if pictorial retention is required. On the other hand, if 

rimarily verbal retention is desired, verbal material should be employed ; and 
ermore, if first-hand experience is the objective, the pupils should be given 
first-hand experience. 
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SUMMARY. Samples of 12-14 year-old British and American children were 
tested with Cattell's HSPQ, and their mean scores compared on fourteen 
personality factors and two second-order factors (extraversion and anxiety). 
Significant differences were found on a number of factors and, in particular, the 
British children were found to be less sociable, more assertive and less con- 
scientious. The pattern of prediction of school achievement was compared, and 
the same factors were in general found to be related to school achievement in 
both countries. 


I.—INTRODUCTION 


ALTHOUGH it is widely recognised that personality factors play a part in 
determining school attainment, methods oi assessing them have in general 
proved disappointin, 

Middleton and Guthrie (1959), for example, say that “ attempts to improve 
prediction by using non-intellective factors such as interests and personality 
traits have yielded quite discouraging results . . . the principal difficulty is 
probably heterogeneity of the criterion- and non-summative or non-linear 
predictions." 

It must be admitted that neither questionnaires nor objective tests of 
personality have yet reached the stage of acceptance as practical instruments 
of selection and guidance, while ratings are very often so bedevilled with halo 
effect as to be even less satisfactory. From the theoretical point of view, 
however, there are enough suggestive and irteresting findings in the literature 
to make it clear that research is getting warn, so to speak, and that real effects 
are there to be discovered when the research instruments have been further 
standardised and refined. The High School Personality Questionnaire appears 
to be one of the more promising instruments available for use with children, 
and has the following advantages : 


(a) It provides measures of fourteen primary factors, and also of two 
main second-order factors into which the pr:mary factors fall, i.e. anxiety and 
extraversion. 


(b) The factors measured have emerged from a considerable body of 
prolonged research. They are briefly descr-bed in the tables that follow. A 
full description is provided in the handbook af the test (Cattell, Beloff and Coan, 
1958). 


(c) The factors correspond in-most cases to those that have been 
extensively studied in adults. 
276 
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(d) The test is designed to be used equally in the British and American 
cultures, and the items contain no glaring Americanisms or Anglicisms._ 


The present study is concerned with the following questions. Are there 
systematic differences between British and American children at age 12-14 on 
some or all of the fourteen primary and two main second-order factors which 
the HSPQ is designed to measure? What degree of prediction of school attain- 
ment is gained by the use of this test of personality ? Do the same personality 
factors make for school success in England as in America ? 


Il.—Previous RESEARCH 


(i) Cross-cultural studies of personality. 


Studies comparing British and American children in terms of personality 
factors appear to be scarce. Sarnoff, et al. (1958), compared anxiety among 
American and English children (American from Connecticut, English from 
Hendon). They were primarily interested in validating a measure of test 
anxiety, and their hypothesis that English primary school children would show 
higher test anxiety than their opposite numbers in the U.S.A. was confirmed. 
On a test of general anxiety, however, no significant national difference was 
found. 

Cattell and Warburton (1961) compared the scores of American University 
students, British graduate students (student teachers) and British students at 
a College of Advanced Technology on the 16 P.F. test. On ten out of fifteen 
primary personality factors (excluding intelligence), differences were found 
between the American and British University students. In addition there 
were considerable differences between the two British samples (sometimes 
larger than the British-American differences), and these were statistically 
significant in the case of seven out of fifteen factors, with smaller groups being 
compared than in the case of the British-American comparisons. The groups 
were also compared in terms of the second-order factors of anxiety and ex- 
traversion and the British University students were found to be less anxious 
and more introverted than the American. British C.A.T. students were least 
anxious and most extraverted of the three groups. 


(i) The relation of personality factors to school attainment. 


Most studies have been concerned with the two broad factors of anxiety 
and extraversion—introversion. Findings have been rather conflicting, and 
the discrepancies can probably be assigned to some or all of the following 
causes : 

(a) Differences in national culture pattern, e.g. extraversion may be more 
of an asset in American primary schools than in British ; 

(b) Different effects at different ages, e.g. it seems plausible, as suggested 
by Furneaux (1957), that introversion may be more of an asset at sixth-form 
and university level owing to changed conditions of work ; 

(co) Different definitions of the variables, and in particular a blurring of 
the distinction between anxiety and neuroticism. (A comparison, for instance, 
of anxiety scores based on the HSPQ and neuroticism scores based on the 
Maudsley personality inventory for children would be useful in this connection.) 

Non-linear relations between the variables, e.g. Cox's (1960) finding : 
that a middling degree of anxiety favoured school work, and Grooms and 
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Endler’s (1960) that only with a high anxiety group did a test of personality 
yield any improvement in the prediction of achievement ; 

(e) Varying effects against different criteria, as in Lynn's (1957) finding 
that anxiety had a different effect, according as the criterion was school work 
in English or in arithmetic ; 

(f) Sex differences. Women have been found to have higher average 
scores than men on introversion 1n a number of studies (e.g. Anderson, 1960). 
It is also possible that the relation between introversion and school achievement 
shows a sex difference. 


In view of these discrepancies, and of considerations of space, a systematic 
review of the literature will not be offered here. A useful survey of recent papers 
in this area is provided by ‘Warburton (1962). 


ITI.—METHOD. 
(a) Samples used. : 

The American sample consisted of the entire seventh grade in two junior 
high schools in Springfield, Tllinois and Paxton, Illinois. They were selected as 
representing contrasting types of area, Springfield being the state capital, with 
the school in question drawing almost exclusively on urban families, and Paxton 
being a small country town, to which many of the children came from the 
surrounding countryside. The numbers of children tested were 154 and 123, 
respectively. There were approximately equal numbers of boys and girls in 
each sample and the mean age was 12-7. 


The first British sample consisted of all the first-year children in an eight- 
stream secondary modern school in South Manchester. The number of children 
tested was 230. There were 113 boys and 117 girls, and the mean age was 12:5. 


The second British sample consisted of 140 children in the third year of 
secondary schooling. The mean age of the children at the time of testing was 
14-3. The sample included equal numbers of boys and girls and was designed 
to be representative of the age-group in ability and of urban society in the 
socio-economic status of parents. 


The availability of two separate samples from different schools in both the 
U.S.A. and in England provides useful evidence concerning intra-country as 
well as inter-country differences. 


(b Administration of the tesis. 


The three authors were responsible for administering the tests to the 
American samples, to the British Secondary Modern sample, and to the 
representative British sample respectively. At Springfield, Illinois, it was 
possible to administer the HSPQ to the entire seventh grade at one sitting in 
the school hall. In the other three samples it was given by classes. Parallel 
forms of the test (A and B) were administered to both American samples and 
to the British secondary modern sample. In the case of the other, more 
representative, British sample it was unfortunately only possible to administer 
Form A. Similarly, measures of school achievement were available for the 
two American samples and the secondary modern sample, but not for the 
second British sample. 
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The criteria of achievemnt available in the American schools were firstly 
the Stanford intermediate achievement test, Form K, which was taken by 
both schools, and which yields a total score, and six sub-scores, as follows : 
paragraph meaning, word meaning, spelling, lauguage, arithmetic reasoning 
and arithmetic computation. Secondly, teacher ratings were obtained on the 
children’s behaviour in school on a graphic seven-point scale. 

The criteria of achievement available in the British secondary modern school 
sample were the results of the annual internal school examinations in English, 
Mathematics, History, Geography, Science and Art. The papers were set by 
the heads of departments in these subjects and all eight streams of children 
attempted the same papers. To aim at a standardization of marking, one 
teacher was responsible for the marking of a particular question throughout all 
the streams. This procedure was not adopted specifically for this research, 
but was part of the annual promotion policy of the school involved. 


IV.—RESULTS 


(a) Compartson of the two American groups. 

Table 1 below shows for the two American schools the mean scores and 
standard deviation on the fourteen primary and the two second-order factors 
measured by the HSPQ. 


TABLE i 
Meran Sconzs (HSPQ, Forms A AND B). 
Springfield 
(N=154) 
Mean | SD. 


Adventurousness 
Sensitivity 
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3-02 
2-48 
2-96 
2-98 
2:77 
2-83 
3-18 
2-92 
3-28 
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8-02 
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2-37 
2-73 
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The urban group (Springfield) is significantly more introverted and more 
anxious than the rural one. The mean difference in anxiety is particularly 
marked, not only in terms of the second-order factor, but also in the fact that 
of the five primary factors on which the groups differ, four (C-, H-, O, Q4) 
load anxiety, and in each case the difference indicates that the urban children 
score in the anxious direction. 
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(b A Comparison between American Children and British Secondary Modern 
Children. 
Table 2 shows a comparison between the total American sample and the 
British secondary modern school sample. 


TABLE 2 
MAN Scorzs (HSPQ, Forms A AND B). 
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Conscientiousness 
Adventurousness 
Sensitivity . 

Passive individualism . 
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The marked difference in intelligence (B) is to be expected, since the 
American group covers the whole range of ability, whereas the British group 
is selected (or ‘non-selected’), and also rather younger. In addition the 
British children are seen to be markedly less sociable, more assertive, less 
conscientious, more timid and more tense, these last two factors (O and 04) 
being the principal indicators of anxiety. In terms of the two second eres 
factors, they are considerably more introverted and more anxious. 

Admittedly, the comparison is not entirely a straightforward one, since 
the British sample is entirely urban and does not contain children from 
secondary grammar or technical schools. But on the first point, it is plain that 
the British sample differs markedly not only from the total American sample, 
but from the American urban sample, and that in every case the differences are 
still in the same direction. 


(c) A comparison between American children and a representative sample of 
British secondary school children. 


This comparison is shown in Table 3 


Perhaps the most striking feature of Table 3 is the confirmation of the very 
marked and highly significant difference already found between British and 
American children on factors E and G. The British children appear as ve 
clearly more assertive and less conscientious. The lower sociability of of Britis 
children is also confirmed, though less strikingly. The low score of the British 
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TABLE 3 
(Meran Scorzs (HSPQ, Form A ONLY). _ 








Conscientiousness ..... 
Adventurousness 
Sensitivity 

Passive individualism . 
Timidity 
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children on factor I (sensitivity) is perhaps rather surprising. It is also interest- 
ing that (unlike the secondary modern group) the representative sample of 
British children does not differ significantly from the American children either 
in extraversion or anxiety. 
(c) The relation between personality factors and school achievement. 

The product-moment correlation coefficients between the fourteen person- 
ality factors and the measures of school achievement are shown in Tables 4-6 
following. 


TABLE 4 


THE RELATION OF SCHOOL ACHIEVEMENT TO FOURTEEN PERSONALITY FACTORS. 
SPRINGFIELD, ILLINOIS. N=153. 


Personality Factors 

















*—-Signif, at 5 per cent. level. 
1—Signif. at 1 per cent. level. 
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TABLE 5 


THE RELATION oF SCHOOL ACHIEVEMENT TO FOURTEEN PERSONALITY FACTORS. 
Paxton, ILLINOIS. N— 124. 


Personahty Factors. 


computation .. 
Teacher rating on 





*__Signif. at 5 per cent. level 
1—Signif, at 1 per cent. level. 


TABLE 6 


THE RELATION OF SCHOOL ACHTEVEMENT TO FOURTEEN PERSONALITY FACTORS. 
MANCHESTER. N=230. 


Personality Factors 


ESC ACE 
01 -10 


451-04 |110 15*.-10 | 05 





34t| 03 -01 08 L01 | 03 
33+] 14*| 01 10 05 [-14* 
28+} 09 | 09 02 L18:[-14* 
261| 02 | 00 10 [-14*.-08 
07 | 08 | 16*}-16* 221| 04 | 06 





*—Signif at 5 per cent. level. 
t—Signit. at 1 per cent. level. 


The main features of these tables appear to be as follows : 


(1) As in earlier researches, the correlations are not strikingly high. With 
the exception of factor B (intelligence), no factor yields a correlation of over -4. 
In general they are higher for the Springfield sample than for either the Paxton 
or the Manchester sample. This may very possibly. be due to a greater 
standardisation of testing conditions, since the entire Springfield group was 
tested at one sitting in the school hall, whereas the other two groups were 
tested by classes. 

(2) There is some consistent pattern as between the three samples. In 
particular, factors G and Q2 correlate consistently with all measures of achieve- 
ment in all three groups. Of the twenty-two correlations involving factor G 
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(conscientiousness), all are positive, and in the American groups the great 
majority are statistically significant. Twenty-one of the twenty-two correlations 
involving factor Q2 (self-sufficiency) are positive, and fourteen of these are 
significant. The role of these two factors is nearly equal. In the Springfield 
sample, both yield correlations of around -3, in the Paxton sample around -2, 
in the Manchester sample around -1. 

(3) There is remarkably little differential prediction of the various 
aspects of achievement. In general, the factors such as G and Q2 which clearly 
predict success, predict it over the whole range of criteria. Similarly, the other 
factors, such as A (sociability) and E- (submissiveness) which predict to a lesser 
degree, (also Q4 in the British sample), still do so, almost regardless of school 
subject. The teachers’ ratings on behaviour record clearly stand in a different 
category in this respect, but even here intelligence, conscientiousness and 
self-sufficiency appear to play some part. But it is noticeable that Q3 (self- 
control) E~ (submissiveness), and D- (lack of excitability) also correlate with 
behaviour rating in both samples. These correlations appear to make good 
psychological sense. 

(4) Factor A (sociability) correlates positively (but low) with all criteria in 
both Ámerican samples, but yields no significant correlation in the British 
sample. It is possible and plausible that this represents a genuine cultural 
difference, though it is hard to be sure owing to the general attenuation of 
correlations in the British group. This attenuation may be due either to the 
ee of testing or to the lesser suitability of the questionnaire to British 

en. 


V.—DISCUSSION 


One obvious question comes to mind in connection with the results reported. 
To what extent were the national samples typical of children in the two 
countries ? 

Some further light can be thrown on this issue by a comparison of the results 
obtained for the American children with norms reported in the handbook of 
the test. Although there is naturally some deviation of our figures from these 
norms, inspection suggests that the national differences found may be real ones. 
If one were to substitute the handbook norms in Table 2 for the figures obtained 
from the combined American sample, every difference from the British Second- 
ary Modern children (on the nine factors where significant differences were 
found) would still be in the same direction. To take one example, a striking 
difference was found on factor G (conscientiousness) between the American 
children (M—12-34) and the British Secondary Modern children (M— 10-63). 
The corresponding mean score of the American group on whom the test was 
standardised was 12-2. 

Similarly for the comparison between the American children and the 
representative British sample on Form A only. If one substitutes the handbook 
norms on Form A for the mean scores obtained by the American children, the 
differences on the factors where significant differences were found would all be 
in the same direction. Strikingly enough, in the case of the two factors of 
assertion and conscientiousness, where very marked differences were found 
between the two groups, to substitute the handbook norms for the actual 
Illinois results would make these differences larger still. Admittedly the 
average difference in age between the American children and those in the 
second British sample (amounting to some eighteen months) impairs the 
pi con, but the results appear sufficient clear-cut to indicate some genuine 

erences. 
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Although it has not been an aim of this paper directly to contrast secondary 

modern with secondary grammar children, it is interesting to compare our 
findings with those of Hallworth (1961). Hallworth points out that the British 
Psychological Society’s report on Secondary School Selection states fairly 
categorically that there is considerably more worry and anxiety among gram- 
mar than among modern school pupils, and questions this statement in the 
light of his own findings, suggesting that the opposite may be true. Our own 
results, in which a‘modern school group were significantly more anxious than 
‘American children, but in which no such difference was found for a represent- 
ative sample, provide some indirect support for Hallworth's findings. 
. . Turning to the correlations with measures of school achievement, one finds 
results that are promising and suggestive rather than conclusive. The co- 
efficients of correlation are generally low, though frequently statistically 
significant. To a large extent this may be inherent in the method of the person- 
ality questionnaire which has long been an object of suspicion and of lively 
satirical attack by laymen (Whyte, 1956). A particular deficiency of the 
HSPQ may be the obverse of its merit, the fact that by aiming to measure a 
large number of factors in a comparatively short testing time, it necessarily 
sacrifices some precision on each factor. Thus factor B (intelligence) yields 
lower correlations than would be expected from a full-scale intelligence test. 
The HSPQ was not constructed with the sole aim of predicting school achieve- 
ment, but with the much wider one of providing a general picture of the 
respondent’s personality pattern. It might well be a useful task for future 
research to concentrate on those factors (such as G and Q,) which appear to be 
most directly related to attainment, and to develop fuller and longer scales for 
the specific purpose of predicting attainment. 


VI.—CONCLUSIONS 


í (1) In Illinois, U.S.A., urban children were found to be more introverted 
and markedly more anxious than rural children. 

(2) Children at a secondary modern school in Lancashire were more 
introverted and very clearly more anxious than the Illinois children. 

(3) When a second British sample, which covered the whole ability range, 
was compared with the Illinois group, no significant differences were found on 
the factors of extraversion or anxiety. 

(4) The American children were found to be very significantly more 
sociable, less assertive and more conscientious tban either British group. 

(5) Apart from intelligence, two of the remaining thirteen factors 
measured by the HSPQ correlate fairly consistently with school attainment. 
These are G (conscientiousness) and Q2 (self-sufficiency). Among the American 
children, factor A (sociability) was the next most consistent predictor. 
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SUMMARY. Most of the few tests designed specifically to investigate intra- 
family relationships are based on psycho-analytic concepts A picture pro- 
jective technique is described, which is based upon the simple theory that a 
child in a friendly environment, will draw tpon his own experience to describe 
the activities of figures in ambiguous pictures. The pictures have been drawn 
to the authors’ specifications. They show six basic family scenes which are 
replicated three times. This replication provides an internal check on the 
consistency of the replies. The accuracy >f this technique was investigated 
by comparing the replies given to the pictures, with the information gained 
through psychiatric and other interviews with the families concerned. 


I.—INTRODUCT-ON 


Ir is generally agreed that a child's behaviour is largely determined by his 
relationships with his family and, in particular, by his relationships with his 
parents. Hence an assessment of the child's neurotic, or delinquent behaviour 
should include a thorough investigation of his family relationships. For 
young children, a picture projective method oi investigation may be used, since 
interviews and techniques requiring written answers are usually impracticable. 
A careful examination of currently available techniques showed that most of 
them were based upon psychoanalytic concepts and were methods of investigat- 
ing personality, Bellak (1950), rather than techniques for assessing inter- 
personal relationships. 

There are, however, five tests designed to explore the relationships between 
the members of a family, but only one of these was a picture test of the 
‘association’ type, Rabin and Haworth (1960). The test devised by Travis 
and Johnston is little more than a simple collection of pictures, Buros (1953). 
The Family Pictures produced by Cummings (1952) consists of a set of faces to 
which the child gives his reactions. Family Drawings by Hulse (1951) is a 
graphic expression of the child's relationships and depends for its success upon 
the child's ability to draw adequate pictures and the examiner's skill in inter- 

reting them correctly. The Family Relations Test by Anthony and Bene 
1957) is a variation of the ‘ forced choice’ technique, in so far as the child is 
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presented with a number of statements which he must then ascribe to members 
of his family. The other test specifically relating to the family is A Test of 
Family Attitudes, by Jackson (1953). This consists of only eight pictures 
which are drawn with blurred outlines, Some of which are highly structured. 
Although the responses to the test are ' associative, there is also a standard 
list of questions for each picture. The test does not appear to be specifically 
based on any particular psychological theory and the author emphasises the 
need for other sources of information to be used in conjunction with the test 
results. 

Whilst each of these tests may be used to assess intra-family relationships, 
it seemed to us that there was room for another technique based upon empirical 
principles which would incorporate features not found in the tests already 
available. Accordingly, the Family Relations Indicator was evolved to assess 
the child's general relationship with his family and in particular, to explore the 
parent-child relationships. The technique is based upon gaining ' associative ' 
responses to a series of twenty pictures portraying family scenes. The responses 
to the cards are not ' interpreted’ and it is not necessary for the child to make 
up a ‘story,’ which so many children find difficulty in doing. Short responses 
of only a sentence, or two, may be quite adequate. The replication of the 
pictures provide a rough internal check on the consistency of the child's replies, 
as well as providing the cumulative effect of a little information from each of 
three pictures. Several of the above features are, so far as we know, either 
found exclusively in our technique, or else they appear in a combination not 
found in other tests. : 


IL.—PLAN OF INVESTIGATION 


Aim. The aim of the investigation was to determine whether the tech- 
nique we had devised, would give a reasonably accurate account of the child's 
relationships with his family. In particular, we wished to discover if it would 
give an accurate picture of the relationships between parents and child. The 
‘ association ' method of responding to the pictures was adopted, since this 
allowed the child the widest range of verbal expression, whilst the pictures 
' channelled ' his replies along the lines of family relationships. 

Theory. It was assumed that if a friendly relationship were established 
between the child and the Examiner, then the child would describe the figures 
in tbe pictures in terms of his own personal experience. It was considered 
possible that ‘ wishful thinking ' and phantasy might distort the descriptions 
of his relationships. In an attempt to minimise this distortion, we decided to 
encourage the child to make statements about the characters, instead of trying 
to compose a story. In this way we hoped that the responses would be factual 
and in accordance with the child’s real frame of reference. We further assumed 
that the child’s responses would reflect the actual home situation, since if a high 
degree of rapport were obtained with the child, he would have little hesitation 
in revealing what he felt about his family. 


Preliminary Investigation. A considerable number of pictures was tried 
out in the preliminary investigation, many of them were taken from books and 
magazines, but some were drawn especially for the purpose. After a consider- 
able amount of experiment, it was decided that aif the pictures must be specially 
drawn for this investigation and that only six basic family scenes could be 
presented. This initial investigation continued for about three years. We 
were eventually able to secure the services of a professional artist, who drew 
the final set of pictures to our specifications, which were ; 
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) The pictures were to be black and white pencil drawings. 
) There was to be as little emotional expression as possible portrayed in 
posture, gesture, or face. 
(3) Clothes were not to be obviously indicative of any particular socio-economic 
class. 
(4) An age range of 8 to 11 years was suggested for the children, with an 
appropriate age for the parental figures. 
) Interior scenes were to have a minimum of detail, but sufficient to suggest 
. definite rooms. : 
6) Furnishings were to be as socially ambiguous as possible. 
7) All scenes must be reasonably familiar to children of school age. 
8) Scenes should be as little ‘ structured ' as possible and dramatic postures 
should be avoided. 
) The pictures were to be drawn clearly, with fairly sharp outlines, heavy 
shading and blurring of outlines was to be avoided. 
10) Ambiguity was to be introduced by the arrangement of the figures in the : 
pictures, not by shading or by blurring their outlines. 
The drawings were photographed and mounted on twelve-sheet board for 
use during the experimentalinvestigation. A specimen card is shown in Fig. 1. 





Fig. 1 


Material.—Tlhe final set of pictures was made up as follows. Three 
introductory cards, showing family groups, were used to acquaint the child 
with the test procedure whilst the main series shows the following basic family 
situations; parents; father and child; mother and child; child alone; a 
sibling group and a child and a baby. Each of these six situations is replicated 
three times, with the figures and the environment being sufficiently altered for 
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the child to accept the replications as three different pictures. There is a 
separate series of cards for boys and girls, the only difference between them 
being the sex of the child in the pictures. The other figures and the environ- 
ment are precisely the same in both cases. The three introductory cards are 
presented on each occasion, together with seventeen other cards making a total 
of twenty for each complete test. 

Sample.—The subjects used in this investigation were children referred to 
the Clinic in the ordinary way, and we took fifty consecutive cases, having 
omitted those in which there was some unusual family constellation, or in which 
little was subsequently discovered about the family The ages of the subjects 
ranged from 4 to 17 years and there were thirty boys and twenty girls. Only 
thirty of the fifty children were tested for intelligence and these had a range of 
I.Qs. from 65 to 136. All the children and their families had been attending 
the Clinic for at least six months, and some of them for as long as two-and-a-half 
years. 

Administration of Test. —The prime requirement for the test is the estab- 
lishment of the maximum rapport between the child and the Examiner. This 
was facilitated when the test was presented on the child's second, or third visit 
to the Clinic. Since this investigation had to be carried out as'part of the 
normal clinical routine, it was sometimes necessary to present the test during 
the child's first attendance. In these circumstances extra care was taken to 
ensure the establishment of good rapport. The cards were introduced to the 
child informally, by saying, “ I have some pictures here of boys and girls and 
adults (or ‘ grown-ups ' if the child is very young). I would like you to have a 
look at them and see if you can tell me what you think they might be doing, 
or saying.' 

The introductory cards were then presented and very often the child would 
say little or nothing about them, and sometimes this reticence would extend to 
the initial pictures of the test series. The child was simply encouraged with 
praise and prompted by means of straightforward questions; What do you 
think they are doing? Where do you tbink they are going? or What do you 
think they are talking about? No attempt was made to cross-question the 
child, or to present a formal series of questions about each picture. Anything 
the child said was accepted, even if it were only an enumeration of the objects 
in the picture. When this occurred, he was praised for what he had done and 
then it was suggested to bim that he might try to think what the people were 
doing. 

The child’s replies were recorded verbatim, sometimes in shorthand and 
occasionally on a tape. Afterwards they were typed in double-spacing to 
facilitate scoring. 


III.—METHOD OF SCORING AND TABULATING THE RESPONSES 


The method of scoring is not mechanical, but depends upon using psycho- 
logical judgment to select from the responses those words and phrases which 
indicate the child's relationships with, or his attitudes towards the members of 
his family. In making a formal investigation of our results we scored the 
responses in the following manner. They were first divided into ' information 
units ' by means of the solidus, /. These units were sentences, clauses, phrases, 
or even single words. Those which described the personal relationships and 
attitudes of the patient were underlined, and called ' behavioural units.' Those 
information units which disclosed neither relationship, nor attitude were simply 
ignored. The ' behavioural units ' were then recorded on the Behaviour Item 
Sheet which was compiled as follows. From the fifty sets of responses obtained 
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during the investigation, we selected twenty at random and analysed them. A. 
list was made of all the expressions which were used to describe attitudes and 
behaviour. Some of these were found to be equivalent to one another and the 
redundant expressions were rejected. Forty-nine items were finally selected 
and grouped together as shown in the following list. 


Attitudes. Verbalssations. Actions. Delinquency. 
1. Likes. 15. Praises. 25. In-tiative. 42. Runs away. 
2. Dislikes. 16. Refuses. 25. Plays. 43. Steals. 
8. Reluctant. 17. Forbids. 27. Shows interest. 44. Deceitful. 
4, Tolerates. 18. Rebukes. 28. Helps. 45. Destructive. 
5. Ignores. 19. Requests. 29. Okeys. 
6. Apprehensive. 20. Demands. 30. Grits. Guilt Feelsngs. 
7. Jealous. 21. Defiant. 31. Hits. 46. Done wrong. 
B. Suspicious. 22. Threatens. 32. Fights. 47. Secretive. 
9. Sulky. 23. Blames/Accuses. 33. Mischief 48. Apologises. 
10. Sad. 24. Argues. 34. Infures/Accident. 49. Amends 
11. Angry 
12. Cross Deprivations 
13. Irritable. 35 Let behind 
14. Apathetic. 86. Kept in 

37. To bed 

38. No toys 

39. Nc food 

40. Nc money. 


41. Nc playmates. 


Six columns were placed parallel to these items and headed F (Father) 
M (Mother) B (Boy) G (Girl) Bb (Baby) and Spare. 

The ‘ behavioural units ' were recorded by entering on the Behaviour Item 
Sheet the letter F.M. etc. opposite the relevant item and in the appropriate 
column. The letter denoted the object towards whom the behaviour was directed. 
The name at the head of the column in which the letter was placed denoted the 
person who produced that behaviour. If the behaviour was repeated a ‘+’ 
was added. The column headed ‘spare’ was used to record the behaviour of 
anyone extraneous to the family group. 

If a feeling, attitude or action was expressed, without any indication of an 
object, then the number of the response only was entered on the Behaviour 
Item Sheet. When all the ‘ behavioural units ’ had been entered in this way it 
was possible, by inspection, to determine the kind of relationships which 
existed between the various members of the family. In everyday clinical work 
this elaborate method of scoring is not necessary. After a little practice, the 
psychologist may dispense with the Behaviour Item Sheet and simply make a 
brief summary of the relevant items in the responses. Thus the following 
specimen responses may be summarised as follows : 

(a) Father is verbally hostile to the bcy. F.1.2.3. 

(b) Father punishes boy by deprivation F.2.3. and by hitting F.2. 

F.t. He is sitting up the table with Dac /and going to get something. / 
Father comes in, / and father ask what he was doing / and he said nothing./ 
Father said, “You were taking something off the table / and father said, 
go upstairs to-day / and you will have no tea." / 

F.2. The little boy is standing near the fire / and father says, “ What have 
you been doing to-day ? ” / Why are you standing near the fire? / Did you 
play with st? / I didn’t play with it./ “ Yes you were, you were poking t.” / 
His ert aad him / and said, “ Go wp to bed / and come down when I 
tell you,” 
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F3. The little boy has a rocket / and father is reading his paper / and the 
little boy made a pile of books / and' put his rocket on / and he is telling 
his father what he has done. / I have put my rocket up / and it is ready to 
fly. / Father said, Put it outside, / don’t do it in here. / The boy took it 
outside / and fired it / and it was lost in the woods / and another boy 
Sound i and took.tt to him./ 


In order to evaluate the accuracy of our results, it was necessary to code 
the information on the Behaviour Item Sheets, so that it could be handled 
statistically. It is notoriously difficult to code data which consist of inter- 
personal relationships and attitudes. Many scales have been devised, but none 
seemed suited to the data in this investigation. The scaling method used by 
Baldwin, Kalhorn and Breese (1949) was too refined for the data and the 
numerical coding devised by Bene (1957) was too complex. It was finally 
decided to employ a simple ‘ present-absent ’ method of coding using five types 
of relationship which were defined as follows: (1) Punttive action, including 
any form of physical violence, or corporal punishment. (2) Verbal hostility, 
including scoldings, rebukes, ‘‘ telling of” and harsh criticisms. (3) Deprivation, 
consisting of the denial, or withholding of food, pocket ‘money, pleasures, or 
companionship ; or being sent to bed. (4) Indifference. This category was used 
when there was an indication of a cold, non-punitive, affectionless attitude 
between parent and child. (5) Positive Attitude, indicating the presence of a 
helpful relationship. Because we were investigating the relationships in 
disturbed families, most of the information was entered under the ' negative ' 
categories. If a sample of the normal population were being investigated then 
more ' positive ' categories would, no doubt, be required.. 


IV.—EVALUATION OF THE F.R.I. RESPONSES 


The information which had been recorded on the Behaviour Item Sheet 
was summarised, using the five categories of relationships just defined. Each 
of these relationships was recorded as existing, or not-existing between mother 
and child and between father and child. A similar assessment was made by 
the psychiatrist from his own knowledge of each- family. The accuracy of the 
F.R.I. responses was then evaluated by noting the frequency with which the 
statements given by the F.R.I. were substantiated by the psychiatric assess- 
ment. 

This is the usual method of evaluating the results of this type of investiga- 
tion, as Foulds (1955) points out. The method has been severely criticised by 
Eysenck (1961) and Kreitman e£ al. (1961) have shown that psychiatric assess- 
ments are not always very reliable. But we had no option but to use this 
method, since all the cases had been investigated by one psychiatrist only and 
therefore a corroborative assessment was not available. The accuracy of the 
psychiatric assessment was probably increased by the following factors: 
(1) The families had been thoroughly investigated over a considerable period 
and the assessments were not made on the basis of simply one, or two psychi- 
atric interviews. (2) The information gained by the psychiatrist during his own 
interviews was supplemented by the following sources : social workers’ reports, 
based on interviews with the parents and visits to the home and school ; results 
of play diagnosis ; school reports ; reports from the family doctor and in some 
cases, information gained in interviews with the probation officer, the children’s 
officer, or the welfare worker. 
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There is, however, one weakness in the psychiatric assessment, which we 
could not avoid, since it was impossible to separate our investigation from our 
normal clinical practice. The children replied to the F.R.I. as part of the 
clinical investigation and the summary of their replies was in the psychiatrist's 
possession when he made his assessment of the families for the comparative 
study. It may, therefore, be argued that these results would contaminate the 
psychiatrist's assessment of traits and relationships. In reply to this we may 
say: (i) The F.R.I. results had been in his possession for at least six months, 
so that he would be unlikely to remember them individually. (ii) In making 
his assessment he would not, of course, refer to the psychological report. 
(iii) The other sources of information would be sufficiently weighty to contradict, 
if necessary, the results of the F.R.I. 


The agreements and disagreements between the F.R.I. and the psychiatric 
assessment are shown in Table 1. 


TABLE 1 
THE FREQUENCIES WITH WHICH THE VARIOUS RELATIONSHIPS WERE SCORED ‘ PRESENT’ 


AND ‘ ABSENT’ BY THE F.R I. AND THE PSYCHIATRIC ASSESSMENT AND THE AGREEMENTS 
BETWEEN THEM. 


Father-Child Relationships 
IV 


Relationship 


Punitive Action 
Verbally hostile 
Deprivation 
Indifference 
Positive Attitude 








Mother-Child Relationships 
Relationship II | HI 





Punitive Action 
. Verbally hostile 


18 
2 
5 

22 

19 





66 


Col. I—No. of cases in which the F R I. and the psychiatric assessment agree 
that the relationship is present. 

Col. II.—Agreement on the absence of the relaticnship. 

Col. III.—Relationship scored ' present’ by tbe psychiatric assessment only. 

Col. IV.—Relationship scored ‘ present’ by tke F.R.I. only. 

-Col. V.—Ratio between Col. I and Cols. I -IV. 
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By using Table 1 we can estimate the extent to which the F.R.I. responses 
rovide accurate information about family relationships. It is clear that the 
F R.I. does not provide a} the available information about parent-child relation- 
ships in the specified categories, nor should we expect it to do so. It is unlikely 
that the result of an investigation lasting half-an-hour (the time usually required 
to administer the F.R.I.) would provide as much information as an investigation 
extending over several months or even years. The difference between the 
amount of correct information provided by the F.R.I. and the total information 
available is indicated by Col. III. These results might be criticised on the grounds 
that this difference is large and that the overall agreements are little better than 
chance. But such a criticism would not invalidate the fact that the inform- 
ation which the F.R.I. has actually provided is fairly accurate as shown in 
Col. V of Table 1. So far as we can discover, no one has attempted to show that 
a picture projection method will produce all the theoretically available inform- 
ation either about individuals, or groups. 

Some authors like Jackson (1950) and Walton (1959) have used the method 
to discriminate between nosological groups of children. Others have used this 
technique to provide a description of the subject’s personality or attitudes, 
Rosenzweig (1948) Phillipson (1955) and Hartwell e£ al. (1953). 

We have attempted to show that the information produced by the F.R.I. 
describes the relationships which actually exist within the family. Since these 
relationships may be independently assessed with reasonable accuracy, this 
assessment forms an ‘ objective’ standard, English (1958) against which the 
accuracy of the F.R.I. results may be measured. In order to do this we have 
compared the total amount of information produced by the F.R.I. with, the extent 
to which this information was verified by the psychiatric assessment. The accuracy 
of the F.R.I. is represented by the percentage ratios given in Col. V of Table 1. 

In addition to parent-child relationships, a considerable amount of inform- 
ation concerning the child’s own attitudes and feelings was given in the F.R.I. 
responses, The nature and accuracy of this information is shown in Table 2. 

Other information of clinical value was sometimes given but not with 
sufficient regularity to make a further evaluation worthwhile. Occasionally 
references to marital difficulties, sibling, or school problems provided valuable 
‘ clinical leads ’ to be investigated during subsequent interviews. 


TABLE2 ^ 


SHOWING THE EXTENT TO WHICH THE FINDINGS OF THE F.R.I. WERE SUPPORTED BY THE 
PSYCHIATRIC ASSESSMENT WITH RESPECT TO THE SPECIFIED TRAITS SHOWN BY THE CHILD. 


Verbally aggressive 
reme aggressive . 
Anti-social 


Col. I —Frequency with which a given trait was present according to the F.R.I. 

Col. II.—Frequency with which the F.R I result was supported by the psychiatric 
assessment. 

Col. III.— Percentage agreement between Cols I and II. 
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No attempt was made to assess the reliability of the F.R.I. This is a 
notoriously difficult undertaking with projective techniques, since comparable 
sets of duplicate pictures are almost impossible to produce and any appreciable 
time interval between test and re-test might vitiate the reliability coefficient, 
since appreciable changes in the child’s attitudes, or relationships, may have 
occurred during that interval. The ‘ split-half’ technique might be employed 
in a similar manner to that adopted by Anthony and Bene (1957). But we have 
already incorporated a modification of this principle in our method of scoring, 
since a relationship was not recorded as ‘ present’ unless it was clearly men- 
tioned in at least two separate responses. 


V.—DISCUSSION AND CONCLUSION 


The Family Relations Indicator provides information about three sectors 
of family life. These are, the relationships between the child and father, the 
child and mother, and certain personal characteristics of the child itself. The 
types of relationships and traits which have been evaluated were not deduced 
from a priori principles, but were empirically derived from the results of a 
preliminary investigation. When the full scale investigation was carried out, 
noothermajorrelationships, or traits, appeared frequently enough in the responses 
to warrant their addition to our original list. The preponderance of ‘ negative ’ 
relationships and traits in our results is due to the fact that the investigation 
was concerned only with emotionally disturbed children, who had been referred 
as patients to the Department. Naturally, not all these patients were equally 
disturbed, nor did they all show the same kind, or number of disturbed relation- 
ships. 

It is possible that some relationships did not appear in the responses 
because they were too painful to describe. This would agree with the theory of 
the repression of painful experiences, but we were not able to investigate this 
point. 

: An inspection of Table 1 shows that the ‘ positive ' relationship between 
father and child occurs in seventeen cases according to the F.R.I., but only five 
of these are supported by the psychiatric assessment. It is possible that this 
discrepancy may be due to wish fulfilment on the part of the child, who would 
like father to show him more interest and attention but, in fact, father does 
not do so. The existence of a gross discrepancy between the two findings on 
this relationship and a fairly high agreement between them on all the other 
relationships, tends to support this view, but there is no independent confirm- 
ation of it. i 

The percentage agreement for the mother-child relationships is appreciably 
greater than that for the father-child relationships. This may be due to the 
fact that most psychiatric investigations tend to be mother-child centred, with 
a relative neglect of father’s relationship to the child. Again, there is no 
independent confirmation of this view. 


Fantasy, which is often associated with children’s projection responses, 
very seldom appeared in the replies we obtained. Fantasy, in the sense of an 
‘ imaginative construction,’ as opposed to wishful thinking, is easily detected 
when present, because the activity described is clearly impossible. Thus, a boy 
of 5 years 10 months replies to card M.2, '' He is going to tickle Dad and then 
tear that book. Tear that book up and kick Dad in the tummy and get a 
spear and kill him.” 
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In reply to card No. S.i, a girl aged 6 years 3 months says, “ They are 
dancing and he kicks up his foot and hits him ‘under the j jaw and kills him.” 
In spite of the fantasy elements in these replies the nature of the relationship 
disclosed seems fairly clear and we scored them as if they had been purely factual 
descriptions. This type of reply occurred only five times in the fifty sets of 
responses. Once a dream was included in a reply and in four other replies the 
influence of adventure stories seemed clear. Thus, a boy aged 10 years says, 
“The boy has a gang, and he’s thinking about a plan and he dug a tunnel from 
one end of the playing field to the other. They made a box like that used by 
the men in the Horse of Troy who were escaping." However, the vast majority 
of replies were free from fantasy elements, and they could not have contained 
much wishful thinking, or the overall accuracy of the replies would not have 
been so great. 

Responses to individual cards seldom exceeded six lines of typing on 
quarto paper and although this is quite short for a projection response, this 
defect was overcome by the replication of the pictures. 

The traits listed in Table 2 were deduced directly from the replies, except 
that ‘ guilt feelings ' were inferred from the secretive, or self-punitive actions 
of the child. 

We have found in subsequent clinical work that it is not wise to try to 
extract too much information from the responses. They usually show the 
parent-child relationships quite clearly and frequently: disclose the attitude of 
the child towards other members of the family. In addition, the responses 
often provide useful information in the form of ‘ clinical leads,’ which, although 
not scoreable in the manner we have described, might well be included in the 
psychologist’s report interpreting the results. Thus, the tone of voice used in 
a reply, or a dramatic gesture may be very illuminating clinically, but impossible 
to score adequately. 

We have encountered several difficulties in interpreting the responses given 
by the children when their family relationships were complicated by removal 
into foster care, the death of a parent, the presence in the home of an aunt, 
grandparent(s) or other relations and the presence of a much older brother, or 
sister, who could conceivably act as a parent figure. These complicated 
situations were excluded from the reported results, as there was no means of 
telling from the responses, whether the child was speaking of his actual parent, 
even if the parent had died, or whether he was speaking of a parent-substitute 
in the form of aunt, elder sibling, or grandparent. When a child gives a set of 
responses in which the characters are not clearly identified, we regard what he 
says about them as applicable to anyone within the family group and follow up 
the ‘leads’ given, in subsequent interviews. We have never used a set of 
formal questions for eliciting information, like those given by Jackson (1952). 


We endeavour to make the maximum rapport with the child and never 
exert pressure to obtain information. It is possible that we might sometimes 
gain more information by systematic questioning, but we feel that information 
can be obtained at too high a price and that it is unwise to press the child for 
information that he will not freely supply, since such pressure would lead to a 
loss of rapport between the child and the examiner. 


It seems clear to us that a projective technique used with young children, 
wil sometimes provide information which could not be otherwise obtained. 
It provides an assessment of personal and family relationships, which could be 
utilised for diagnostic, or therapeutic purposes. The projection method is also 
valuable, because it often yields information quickly, and. provides valuable 
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clinical leads at the first, or second interview. The technique we have described 
fulfils these functions over a limited area, but does not pretend to give extensive 
personality pictures, or an exhaustive account of inter-personal relationships. 
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THE RELIABILITY AND VALIDITY OF THE 
GOODENOUGH ,DRAW-A-MAN TEST 


A PILOT LONGITUDINAL STUDY 


By M. L. KELLMER PRINGLE anv K. T. PICKUP 
(University of Birmingham, Institute of Education*; Huddersfield 
Educational Guidance Service) 


Summary. The findings reported here—which are part of a much larger 
longitudinal project—deal with the drawings of a man made by thirty-seven 
children at yearly intervals throughout their junior school career, i.e., between 
the ages of 7 to 10 years. A total of 148 drawings was analysed according to 
Goodenough’s instructions. The intelligence quotients obtained from the 

wings were intercorrelated : (a) with each other; (b) with the Stanford- 
Binet Intelligence Quotients and; (c) with W.I.S C. Verbal, Performance and 
Full Scale Quotients. An analysis of the fifty-one items of the Goodenough 
Scale was also carried out to investigate whether or not there is a regular, 
annual increase in scores over the four-year period. 

The findings suggest : 

(1) That for junior Shool children the longitudinal test-retest reliability of the 
Goodenough Hense is too low for it to be regarded as a stable measure of intelhg- 
ence. 
(2) That the Scale's validity is even lower, average correlations with quotients 
obtained from the Stanford-Binet Scale as well as from the W.I.S.C. Verbal, 
Performance and Full Scale being in the region of -2. 
(8) That its discriminating power between successive years is not entirely 
satisfactory. 


J.—INTRODUCTION 


STUDENTS of child development have for long been interested in the apparent 
correspondence which exists between children’s maturing intellect on the one 
hand and the progressively greater representation of detail and proportion 
which is shown in their drawings of a man on the other (Cooke, 1885 ; Maitland, 
1895; Partridge, 1902; Kerschensteiner, 1905 ; Lobsien, 1905 ; Ivanoff, 1909; 
Katzaroff, 1910; Rouma, 1913; Luquet, 1913; Zesbaugh, 1934). The first 
attempt to establish age norms was made almost 60 years ago (Shuyten, 1904). 
In devising a scale to assess intelligence by means of children’s drawing of a 
man, Goodenough’s (1926) approach differed from that of previous investi- 
gators in the following respects: (a) no a priori decisions were made as to 
features characterising intellectual ability ; (b) artistic standards were dis- 
regarded ; (c) efforts were made to eliminate subjective judgments as far as 
possible; (d) normative validity was derived from both chronological age and 
educational status ; (e) a standard subject for drawing was chosen, but no other 
limitations were imposed. In all, five revisions of the scale were made, a differ- 
ent set of drawings being used for each revision to avoid the error of validating 
a point by means of the same drawings from which it had been derived. The 
scale now consists of fifty-one items, the norms being based upon the drawings 
of 2,306 children aged from four to ten years. 

. . Since the publication of Goodenough's study numerous investigations have 
been carried out using the Draw-a-Man Scale. Ansbacher (1952) studying 100 
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fourth grade children whose mean age was ten years, found that it correlated 
most highly with reasoning, spatial, perceptual factors and ‘ tracing ' (‘‘ a test 
loaded with youthful willingness to follow directions.”) It had least in 
common with verbal meaning and number. Stewart (1953) concluded “ that 
it compares favourably in test-retest reliability with most group tests of 
intelligence applicable in the same age range. It also compares favourably in 
validity, as demonstrated on the basis of its correlations with the Stanford- 
Binet Test within the age groups for which it was designed, while yet possessing 
the advantage of being non-verbal in character." However, Stewart maintains 
that this instrument ought to be kept fully up-to-date; also that certain 
weaknesses need to be eliminated, such as unduly high norms above the ten 
year level and the fact that the present scoring system penalises children who 
draw full-face rather than profile figures. 


Yague and Agullo (1959) studied the drawings of a man made by fifty-seven 
9-year-old children. They obtained a test-retest reliability coefficient of -846 
which they considered to be too low for individual prediction. Validity co- 
efficients between the Goodenough on the one hand and the Cattell Test I and II 
and Otis on the other hand, were respectively -25, -12 and :38. They also 
concluded that the Goodenough Scale has little value in predicting school 
success or intelligence. Springer (1941) made an item analysis of the scale to 
ascertain which items differentiate between maladjusted and well-adjusted 
boys (as classified by teachers’ ratings on the Haggerty-Olson-Wickman 
Behaviour Rating Schedules). Of the fifty-one items of the Draw-a-Man Test, 
fifteen showed critical ratios of 3:00 or higher; this was considered to be 
“ indicative of a significantly reliable difference between the two groups that 
are compared." On nine of the fifteen items the maladjusted were more success- 
ful, and on the remaining six the adjusted achieved greater success. “‘ The 
adjusted tend to be more successful in those items which involve motor co- 
ordination and correct proportions of arms and legs to the trunk. The mal- 
adjusted tend to include more detail in their drawings in regard to the parts of 
body and clothing.” However, the maladjusted also showed more incongruities, 
especially confusions between full-face and profile presentation, and a poor 
sense of proportion. It is interesting to note that a somewhat similar pattern 
of differences was found in comparisons between normal and retarded children 
(McElwee 1932 and 1934, Israelite 1936, Spoerl 1940, and Goodenough 1950). 
Merguet (1958), having administered the scale to 1,042 Child Guidance cases 
ranging in age from 3 to 14 years, concluded that it is particularly useful in 
testing the ability of children of normal and sub-normal intelligence ; whereas 
with children of high ability it is more meaningful as a projective technique. 
In Vernon's view (1960) “ the published norms appear to apply pretty accurately 
to British children," though the reliability of re-tests is rather unsatisfactory. 


The Goodenough Draw-a-Man and the Stanford-Binet Scale. 

Goodenough (1926) found that for children aged from 4 to 10 years, 
correlation coefficients between mental age as measured by her scale and by 
the Stanford-Binet Intelligence Test ranged from -56 to -86. Williams (1935) 
arranged for the drawings of 100 children aged 3-15 years to be scored in- 
dependently by five examiners. Correlations between examiners ranged from 
:8 to -96 and of the relationship between the Stanford-Binet ‘and Goodenough 
test results, Williams stated: “ While, in general, the agreement is close 
enough for practical purposes, extreme deviations, often of clinical significance, 
may be expected.” 
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Using both Forms L and M of the Stanford-Binet Test, McHugh (1945) 
studied their relationship with the Goodenough Scale with a group of ninety 
children whose mean age was 6 years and 4 months with a standard deviation 
of 3:97 months. A significant correlation of -45 was found between the mental 
ages obtained on the Stanford-Binet and Goodenough tests and the correlation 
between the intelligence quotients was +41. Higher correlations might well 
have been obtained had only one form of the Stanford-Binet Test been used. 
With thirty-seven feeble-minded boys aged 9-18 years, Yepsen (1929) obtained 
a correlation of -6. 


Birch (1949) compared tie Stanford-Binet and Goodenough mental ages 
of a group of sixty-eight borderline and mentally defective children between the 
ages of 104 years and 16 years 3 months, obtaining á validity coefficient of +62. 
He concluded that the Draw-a-Man test is a valid measure of mental ability i 
children within the age range studied whose Stanford-Binet I.Qs. are 7 
below. 


Brill (1937) found that maladjusted adolescents tended to score lower on 
the Goodenough than on the Stanford-Binet Scale, whereas the reverse obtained 
for most well-adjusted subjects. ‘‘ Thus a mentally defective boy who scores 
approximately two years or more higher on the Goodenough than is expected 
by reason of his Binet general mental level belongs most likely to the adjusted 
group. On the other hand, a mentally defective boy who scores approximately 
two or more years lower on the Goodenough than on the Binet belongs most 
likely to the maladjusted group." Hinrichs (1935) compared the Draw-a-Man 
performance of delinquent and non-delinquent subjects aged from 9 to 18 years, 
whose I.Qs. were 80 and above on the Terman. As most of them were above 
the age for which the Goodenough scale had been standardised, Hinrichs 
devised an upward extension but found this to be neither as reliable nor as 
valid as the part designed for younger children. The subjects were eighty-one 
boys living in an institution for delinquents who were matched with non- 
delingent boys for chronological age and home background. The delinquents 
were found to have lower I.Qs. on the Goodenough than on the Binet Scale. 

Using the two tests with a small group of children suffering from post- 
encephalitic behaviour disorders, Bender (1940) found that the I.Qs. obtained 
from the drawings were far below those resulting from the Stanford-Binet Test. 
This she interpreted as being due to a considerable disruption of the body 
image caused by tbe cerebral damage and suggested that a marked discrepancy 
between the two measures may indicate the possibility of some brain damage. 


The Goodenough Dratw-a-Man and the Wechsler Intelligence Scale for Children. 


Though the Goodenough Scale was published more than thirty years ago, 
Sundberg (1961) found that it vies in popularity with the W.I.S.C. which was 
published only thirteen years ago. Indeed, among 185 testing services in the 
United States, it seems to occupy the highest position of any intelligence test in 
clinical use except the Stanford-Binet Scale. Hanvik (1953) gave the Good- 
enough and W.I.S.C. scales to twenty-five child psychiatric patients whose ages 
ranged from 5 to 12 years. The group obtained significantly lower I.Qs. on the 
Draw-a-Man test while the W.LS.C. quotient fell below or equalled the 
Goodenough score in only four cases. The rank order correlation between the 
quotients obtained with the two scales was -18. Hanvik concluded that 
emotionally disturbed children do not draw the human figure in a way which 
is commensurate with their ability as measured by the W.I.S.C. 
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The Draw-a-Man test used with different age groups and longitudinally. 


: Oakley (1940) observed that with the coming of adolescence drawings of a 
man undergo considerable change and there may even appear to be a deteriora- 
tion in quality ; for example, lines are drawn indecisively and parts, which are 
difficult to draw, such as the hands, are sketchily treated or omitted. But 
although some rather obvious features are lost and there is a deterioration in 
the logical arrangement, this may be offset by a subtlety which was absent 
previously. In Oakley’s view these changes come about through the child’s 
losing confidence because he can now perceive how imperfect his drawings are. 

Djukic (1953) examined 900 drawings made by boys and girls, ranging in 
age from 3 to 10 years, to determine whether there is any developmental 
sequence in the manner in which the various parts of the human figure appear 
with increasing age. He found that every part of the face or body seems to 
evolve independently of every other part ; moreover, the stage of development 
reached in representing one part does not necessarily correspond to the stage 
reached in representing other parts. 

The earliest recorded longitudinal study is that of Luquet (1913), who 
kept some 1,700 drawings made by his daughter between the ages of 3 to 8 
years. Care was taken to keep her drawings as free as possible from the 
influence of adults and other children. He found that when a new characteristic 
appeared it did not become ‘ fixed’ at once but over a considerable period of 
time would disappear and reappear in the drawing of the figure. McCurdy 
(1947) selected fifty-six drawings from a collection made by a child between the 
ages of 24 to 7 years. The average Goodenough I.O. was 171 with a standard 
deviation of 22. He also compared the drawings of a man made by 7-year-old 
children on two occasions which were separated by a three months interval ; 
there were fifty-nine children in the sample. The correlation between the two 
sets of drawings was -69, and the difference between the mean intelligence 
quotients of the two samples was not significant. From these results McCurdy 
argued that variations in Goodenough scores rest upon fluctuation in perform- 
ance and may not be due merely to an error of measurement: “ The behaviour 
measured by the Binet test apparently has greater stability than that measured 
by the Goodenough test." He also concluded that the variability exhibited both 
by the individual and by the group is of about the same magnitude. 

Ochs (1950) studied the drawings of a man made by 120 maladjusted 
children (eighty boys and forty girls) who had received treatment in a Children's 
Psychiatric In-Patient Unit. Their ages ranged from 5 to 13 years. The aim 
was to determine ‘‘ whether improvement in social adjustment in a particular 
child was paralleled by increase in Goodenough drawing I.Qs. as might be 
expected on the basis of clinical and experimental findings."  Retests were 
carried out between five and thirty-six months, the mean interval being 
fourteen months. It was found that Goodenough I.Qs. tended to increase with 
better adjustment and to decrease with continued maladjustment. There was 
little consistency regarding addition or omission of scoring items on re-tests as 
compared to initial test ; similarly, there were no consistent changes in the 
quality of the drawings. Ochs concluded that the correlation between improved 
social adjustment and drawing score is positive but only moderately high. 


IL—DESsIGN OF THE STUDY - 
So far there has been no attempt to investigate the reliability of the 


Goodenough Scale longitudinally or to validate it longitudinally against sorne 
accepted individual test of intelligence; such as the Stanford-Binet Scale or the 
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Wechsler Intelligence Scale for Children. The results reported here form part 
of a much larger longitudinal study concerned with various aspects of children’s 
intellectual, emotional and educational development during the junior school 
years ; eventually data will be available for some 230 children. 

In this pilot project the sample consists of thirty-seven children, twenty-one 
boys and sixteen girls, each of whom drew a man at yearly intervals throughout 
the four year period of their junior school career. This group was in no way 
selected but simply consisted of the children for whom the necessary intelligence 
test results were available at the time, namely early in the.second term of their 
last year in the junior school. The school from which the sample was drawn 
serves a post-war local authority housing estate, thus consisting largely of 
working-class families with a sprinkling of professional people. In the first three 
years at this school, children are placed in unstreamed classes, but in the final 
year they are streamed according to level of attainment. Our sample was 
drawn fairly evenly from the four streams, consisting of ten A stream, seven 
B stream, eleven C stream and nine D stream pupils. 

A total of 148 drawings, made when the children were 7, 8, 9 and 10 years 
old, respectively, was analysed. The intelligence quotients calculated for each 
of these (according to the instructions and norms given by Goodenough, 1926) 
were intercorrelated (a) with each other, (b) with the 1937 Stanford-Binet 
Intelligence quotients and (c) with the W.I.S.C. Verbal, Performance and Full 
Scale quotients. An item analysis of the fifty-one items of the Scale was also 
carried out to investigate whether or not there is a smooth, overall increase in 
scores over the four year period. Lastly, to compare the scores obtained by our 
experimental sample with those made by the large standardisation samples of 
the Stanford-Binet Test and the W.I.S.C., coefficients of correlation were 
calculated between scores obtained on the Stanford-Binet and the W.I.S.C. 
Verbal, Performance and Full Scale tests respectively ; Full Scale scores were 
also correlated with Verbal and Performance Scale scores. The product-moment 
method of correlation was used in all cases. 


III.—REsurTS 


There was a low degree of inter-correlation between the intelligence 
‘quotients derived from the drawings of a man made at yearly intervals over the 
four-year junior school period ; the highest correlation (:53) existed between the 
second and third year drawings and the lowest between those made in the 
first and second year (-22). This finding suggests that the longitudinal test- 
retest reliability of the scale is low, the average over the four year period being 
in the region of -4 (Table I). Levels of significance reaching the 5 per cent. or 
1 per cent. level of confidence are indicated in the Tables. 


TABLE 1 


INTER-CORRELATED I Q. SCORES or THIRTY-SEVEN DRAWINGS OF A MAN MADE IN THE 
First, SECOND, THIRD AND FOURTH YEAR OF THE JUNIOR SCHOOL. 


DAM 3 DAM 4 
-413 (5% -352 A 
: (] 


-534 (1% 





* DAM 1—Drawing of a Man, 1st year, Junior School. 
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Means and standard deviations are shown on Table 2. All the obtained 
means exceed 100, which, for an urban sample such as this, is in accordance 
with other findings. The mean and standard deviation of the Goodenough are 
lowest for the fourth year in the junior school. A mean I.Q. greater than 100 
for the Stanford-Binet is also in line with other results reported for British 
children (Scottish Survey, 1949, Heasman 1955, Marshak 1956, Jones 1962). 
Similarly, the fact that the mean I.Qs. for all scales of the W.I.S.C. are above 
100 but below the Stanford-Binet mean, is in line with recent findings in this 
country (Jones 1962). The standard deviations for both the W.I.S.C. and 
Stanford-Binet quotients are somewhat lower than those given for the standard- 
isation samples at corresponding ages (Wechsler 1949 and Terman 1937). 


TABLE 2 


THE MEAN AND S D. or I.Qs. on THE GoopzNoucH, W L.S.C AND STANFORD-BINET ScALES 





Draw-a-Man, Ist year 
Draw-a-Man, 2nd year 
Draw-a-Man, 3rd year 





The inter-correlations between the quotients obtained from the drawings 
of a man and the W.I.S.C. Verbal, Performance and Full Scale were even lower, 
ranging from :4 to -01. They were significant (at the 5 per cent. level) only 
for the first year. There is a trend for the Goodenough quotients to correlate 
highest with the W.I.S.C. Full Scale quotients; also correlations between 
Goodenough quotients and all three W.I.S.C. quotients are highest in the first 
year and lowest in the fourth year of the junior school. A very similar pattern 
obtained regarding the correlation between Stanford-Binet and Goodenough 
quotients : inter-correlations were low throughout but highest in the first year 
(:37) and lowest in the fourth year (-12) of the junior school (Table 3). Thus 
the longitudinal validity of the Goodenough Scale seems to be low in relation to 
both the W.I.S.C. and Stanford-Binet Scales. 


TABLE 3 


CORRELATIONS BETWEEN GoopENouGH, W.l.S.C. AND STANFORD-BiNET (1937). 
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To investigate the comparability of the W.I.S.C. and Stanford-Binet 
intelligence quotients obtained by our sample with those of other validating 
studies, inter- and self-correlations were calculated. Correlation coefficients were 
found to range from -59 to -85 the lower obtaining between W.I.S.C. Perform- 
ance and Stanford-Binet quotients and the higher between W.I.S.C. Full Scale 
and Verbal Scale quotients (Table 4). These results are in accordance with 
other studies dealing with the relationship between the W.I.S.C. and Stanford- 
Binet Scales. Thus they suggest that our sample is comparable to the larger 
samples which have been used for validating purposes, where correlation co- 
efficients ranged from :6 to +9 (N. D. Sundberg in Buros, 1959). 


TABLE 4 


INTER- AND SELF-CORRELATIONS BETWEEN W I.S C. AND STANFORD-BINET. 


vs. S-B 
. (P) vs. S-B 


ES) vs. (9) 


An analysis of the fifty-one items of the Goodenough Scale indicated that 
there is not an overall tendency towards a regular annual increase. It would 
seem that with increasing age there is either a decrease in interest or in attention 
to detail. As the sample progressed through the junior school, there was a 
gradual falling off in the number of children showing steep increases in scores ; 
this trend was most marked between the third and fourth year and least marked 
between the first and second year. Over the years there was also an increase in 
the number of children showing a deterioration in scores ; this trend was again 
least marked between the first and second year and most marked between the 
third and fourth year. However, these tendencies did not affect the overall 
pattern, according to which the number of items on which there was an annual 
decrease was much smaller than those on which there were yearly increases. 


: s vs. S-B .... : -773 (1%) 
: 





IV.—DISCUSSION AND CONCLUSIONS 


The findings of this pilot study suggest that for junior school children the 
longitudinal test-retest reliability of the Goodenough Draw-a-Man Scale is too 
low for it to be regarded as a stable measure of intelligence (around -4). Its 
validity is even lower. For each of the four years, average correlations with 
intelligence quotients obtained from the Verbal, Performance and Full Scale 
W.I.S.C., as well as from the Stanford-Binet Scale, were within the region of 
‘2to +3. Hence, it would seem that the Goodenough Scale is both invalid and 
unreliable as a measure of intelligence for children aged seven to ten years. 


In horizontal studies of drawings of a man made by children aged 7 to 11 
years, Goodenough (1926) found average correlation coefficients of -8 for 
validity and -9 for reliability. More recent studies, using different methods of 
correlation, suggest validity to be of the order of -4 and reliability to be around 
:8. Though these figures are lower than Goodenough's, they are still higher than 
those found in the present study. There are at least three possible hypotheses to 
account for these rather divergent findings: first, that the samples employed, 
both in this and in most of the other studies, were too small to yield statistically 


304 Goodenough Draw-A Man Test 


meaningful results ; secondly, that the norms have become out of date, making 
a re-standardisation of the scale necessary; or thirdly, that a longitudinal 
study is not statistically comparable to a horizontal one. Of course, these 
hypotheses are not mutually exclusive. When the data for our larger longi- 
tudinal study become available, they may go some way towards substantiating 
or disproving these hypotheses. 


Our findings suggest that during the junior school period the Scale's 
discriminating power between successive years may not be entirely satisfactory. 
Some early observations seem to be borne out by our data namely that óf 
Luquet (1913), that new features shown in a child's drawing of a man do not 
become ‘fixed’ but may disappear and reappear over a period of time, and 
that of Oakley (1940) regarding some deterioration in the quality of figure 
drawings with the onset of adolescence Nowadays quite a few children are 
approaching adolescence during the end of their junior school career. Thirdly, 
the loss in popularity of the human figure as a subject for children's drawings 
towards the end of the Junior school stage, was first commented upon by 
Maitland (1895). Studying 1,570 drawings of children who were asked to draw 
a subject of their own choice, she found that a human being was chosen by 45 
per cent. of children aged between 5 to 7 years. by 40 per cent. of those between 
8 to 10 years, by 12 per cent. of 11 to 13-yeaz-olds and by only 5 per cent. of 
children who were between the ages of 14 to 17 years. In our study this in- 
creasing loss in popularity showed itself in the children's overt attitude to the 
task: during the first two years of this longitudinal study they settled quite 
happily to drawing a man, but during the third and, even more markedly, the 
fourth year they showed many unmistakable signs of dislike : there were sighs, 
groans and also delays before the drawing was started ; quite a few of the 
children insisted on hiding their ‘man’ from their neighbours as well as fram 
the investigator (M.L.K.P.) whom by that time they knew very well. 


When the Stanford-Binet and W.1.S.C. are used as validating yardsticks, 
it is clear that the Goodenough Scale does nct yield comparable estimates of 
intelligence for children aged between seven end ten years. However, in the 
absence of a generally agreed definition of intelligence, one could argue that the 
Goodenough taps different aspects of ability from those assessed by means of 
the Stanford-Binet or W.I.S.C., especially as it :s a non-verbal test. Ansbacher’s 
(1952) findings lend some support to this view, as do some of the studies dealing 
with mentally subnormal, maladjusted, delinquent and otherwise handicapped 
children. Moreover, the Scale’s greatest usefulness may well lie with younger 
age groups. What seems needed is a normative and longitudinal investigation 
with British children aged beteeen 3 to 7 years. 
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A FURTHER STUDY OF PROGRESSIVE MATRICES (1947) 


By AUBREY J. YATES 
(University of Western Australia)* 


Summary. Two groups of university students were allowed sixty minutes for 
Progressive Matrices (1947), indicating the point reached after forty minutes. 
One group began at item No. 1, the other group at item No. 21. Analysis of 
the data indicated that for about 13 per cent. of the sample, the score after 
forty minutes markedly underestimates the level the subject would be able to 
reach if given longer time. These subjects make few errors within the forty 
minute time limit, but attempt many fewer items than faster working students. 
It is suggested that records be scrutinized for such patterns and additional 
time allowed where necessary. 


I.—INTRODUCTION. 


In a previous study (Yates, 1961), an item analysis of Progressive Matrices 
(1947) was carried out on the scores of two groups of university students. While 
the test discriminated very well within the groups, it was also shown that : 


(i) Many of the first twenty items were of negligible discriminatory value’ 
the middle twenty items discriminated best; the last eight items were 
attempted by very few subjects. 


(ii) The increase in difficulty level apparently showed a sharp upward trend 
after item 35. However, since few subjects attempted the final items, their 
true discriminatory value could not be assessed. 


(iti) Comparison of individual scores was rendered difficult by the fact that 
individuals obtaining the same score had often attempted a differing 
number of problems. 


Consequently, it was suggested that the test might be improved if the first 
twenty items were omitted (while retaining the practice set) and the time limit 
increased from forty to sixty minutes. The present study investigated the 
validity of these suggestions. 


II.—PROCEDURE. 


The samples were drawn from a complete class of 142 external students in 
residence at the University of New England during the second vacation of the 
academic year. For practical sessions, they were split into two approximately 
equal groups on an alphabetical basis. At each of two practical sessions, the 
sub-groups were further divided into two sub-groups. After both sub-groups had 
completed the practice booklet, one sub-group was instructed to commence the 
test proper at item 1, the other sub-group at item 21. After 40 minutes’ work, 
both groups indicated the last item solved by circling the item number. They 
were then given a further twenty minutes work on the test. In this way, two 
sub-groups submitted to each treatment were formed. Since the sub-groups 


* The data pertaining to this study were collected at the University of New England, 
New South Wales. 
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submitted to the same treatment showed no significant differences in perform- 
ance, results were combined to give two treatment groups: Group 3* consisted 
of seventy-three subjects who started at item 1 ; Group 4 consisted of sixty- 
nine subjects who started at item 21. 

At other practical sessions, all subjects were given the Advanced AL and AQ 
tests (described in the previous paper) as a check on the comparability of these 
groups, both with each other and with the external group used in the earlier 
experiment. No important differences were found. 


III —HYPOTHESES. 


The design outlined in the previous section was intended to provide 
evidence with respect to the following hypotheses : 

(i) Omission of the first twenty items of the test would lead to a significant 
reduction in the number of items not attempted in the remainder of the 
test (items 21-48) in both the forty minute and sixty minute periods 
compared with the group which was given the complete test 

(i) If the first hypothesis was mind then the increased number of items 
attempted would lead to an increase in discriminatory power of the test 
within a university sample, both for the forty minute and sixty minute 
periods. 

(iii) The extension of the time limit from forty to sixty minutes would sig- 
nificantly improve the discriminatory power of the test. 


IV.—RESULTS AND DISCUSSION. 

Table | shows the average number of items passed, failed, and not attemp- 
ted during the standard time limit of forty minutes, and the extended time 
limit of sixty minutes, for the two groups separately. In calculating these figures 
the first twenty items were discarded for Group 3, making their results com- 
parable with those for Group 4. 


TABLE 1 
PERFORMANCE OF Two Groups ON ITEMS 21-48 or PROGRESSIVE MATRICES (1947) AFTER 
FORTY MINUTES AND Srxty MINUTES 


First 40 Minutes Total Time (Sixty Minutes) 
No of Items No. of Items 








, Not 
Passed i . Failed | Attempted 


Group 3* X | 11-59 . . . 10-77 
c | 415 . . . 5-00 








Group 4+ X | 13-01 . . . 12-58 
g| 3-38 : . . 4-28 








* Began test with item 1 
t Began test with item 21. 


* The groups are so labelled to distinguish them from Groups 1 and 2 employed in the 
previous experiment (Yates, 1961). 
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The principal results may be summarized as follows : 

(i) Omission of the first twenty items produced a significant reduction in the 
number of items not attempted (9-90 vs. 6-00 ; t—4-92, p «-01). Increas- 
ing the time limit by twenty minutes also produced a significant reduction in 
number of items not attempted, both for Group 3 (9:90 vs. 3-71 ; t— 18-34, 
p «-01) and for Group 4 (6-00 vs. 1-12 ; t—14-88, p «-01). 

(ii) The increase in number of items attempted in sixty minutes was accompan- 
ied by a significant increase in number of items passed, both for Group 3 
(13-55 vs. 11-59; t—8-37, p «-01) and for Group 4 (14-30 vs. 13-01 ; 
t—6-14, p «-01). 

(ii) The omission of the first twenty items produced a significant increase in 
the number of items passed, after forty minutes (13:01 vs. 11:59; t—2-22, 
p «:05), but not after sixty minutes (14-30 vs. 13:55 ; t— 1-13, p» 05). 

(iv) The correlation between forty-minute and sixty-minute score was +-0-950 
for Group 3, and 4-0:942 for Group 4. These correlations should be viewed 
cautiously, however, since they indicate the relationship between a score 
X (items passed after forty minutes) and a score X-I- Y (items passed after 
sixty minutes). 

In order further to investigate the effects of the addition of twenty minutes 
to the time allowed, the following analysis was carried out. Score increases in 
the final period ranged from zero to eight. The subjects in the two groups were 
combined and regrouped according to their score increase. Table 2 indicates the 


TABLE 2 


RELATIONSHIP BETWEEN SCORE INCREASE AFTER FORTY MINUTES AND PATTERN OF SCORES 
DURING First Forty MINUTES (COMBINED GROUPS). 


Score Pattern (Items 21-48) 
After Forty Minutes 
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distribution of pass, fail, and not attempted scores in the first forty minutes in 
relation to score increase in the last twenty minutes and reveals a very striking 
pattern. Subjects who showed no score increase clearly have attained their 
upper limit of success in the first forty minutes and do not benefit by additional 
time. Subjects whose score increase was one, two, or three points, appear to 
form a homogeneous group which is not very different from the zero-increase 
group. However, subjects whose score increase was four, five or six points, also 
form a homogeneous group, but with a very different pattern of performance in 
the first forty minutes compared with the other groups. While their pass rate is 
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only slightly less than that of the other groups, their failure rate is very much 
lower and their not attempted rate very much higher. In other words, these 
subjects (forming about 13 per cent. of the total sample) appear to work slowly 
but at a high level of accuracy during the first forty minutes and, in terms of 
power, are severely handicapped by a time limit.* It is perhaps worth pointing 
out that a score increase of six points represents, in terms of the Australian 
norms for this test an increase in centile rank (using a score increase from 27 to 
33, as an example) from 20-3 to 56-7. The intellectual level of these students will 
be severely underestimated if an artificial time limit is imposed on the test. 
It is an open question whether this distinctive group would be handicapped 
or not in either their university courses or examinations by this apparent 
tendency to work slowly but very efficiently. In the absence of further informa- 
tion, either effect would be conceivable. Thus, it could be argued that they 
would have difficulty in completing enough high-level work under constant 
pressure of time (a point of particular relevance where part-time external 
students are concerned) or of writing enough material under examination 
conditions to obtain results commensurate with their ability. It might equally 
well be argued, however, that, although their output might be smaller than that 
of faster-working students, its quality would more than compensate for this. 
Some information, of an inconclusive nature, is available on this point.t 
All the subjects involved in this study were enrolled in Psychology I and their 
examination results are shown in Table 3. Four categories of results were avail- 
able—pass with merit; pass; fail (but granted a supplementary examination) ; 


TABLE 3 


RELATIONSHIP BETWEEN SCORE INCREASE AND Pass/Fam CATEGORY IN PsvcuoLocv I 
EXAMINATION. 


Examination Category 
Score — | 
Increase Pass With Pass Provisional* 
Merit Fail 








Low Increase 13 72 7 29 
(0-3) (10-75%) (59-50%) (579%) (23-97%) 








High Increase . .. 
(4-8) 


4 9 2 6 
(19-05%) (42-86%) (9-52%) (28-57%) 





* Candidate granted a supplementary examination 


and fail outright. The figures in brackets are percentages. It is clear that the 
' high-increase ' group shows a pattern of results which is fairly similar to that 
of the ‘ low-increase’ group and it may be concluded that the characteristic 
pattern of work on the intelligence test does not appear to have any marked 
' implications as far as passing or failing examinations is concerned. This is not 
surprising since success or failure in examinations is undoubtedly a function of a 
*In his manual, Raven did not specify any time limit; the Australian norms 
published by the A.C.E.R are, however, based on a time-limit of 40 minutes. 


t For information contained in this section, the author is indebted to Mr. H C. Sheath, 
Director of External Studies of the University of New England, and to the Heads of 
Departments, for kindly supplying details of examination results. 
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large number of factors, of which speed of working is only one. Two comments, 
however, may be made. Students who proceed in a cautious but accurate way 
are capable of succeeding (in some cases at a high level) in time-limited examina- 
tions. Secondly, it is not known to what extent students in the pass category 
might be capable of lifting their performance into the pass-with-merit category, 
given additional time ; nor is it known whether some of the students who failed 
the examination might have passed with more time at their disposal. These 
points would merit further investigation, perhaps by the use of objective-type 
examinations or the use of timed and untimed attainment tests. 
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SOCIAL FACTORS AND SCHOOL INFLUENCE IN 
JUVENILE DELINQUENCY 


AN ANALYSIS OF POLICE CASES IN THE BELFAST JUVENILE 
CourT, Jury, 1961, To JuNE, 1962 


g By G. A. DELL 
(Educational Psychologist, Belfast Education Auihority)* 


Summary. An analysis of some of the educational and social factors associated 
with 492 police cases appearing in the Belfast Juvenile Court over a period of 
twelve months, showed the heaviest incidence of delinquency among children 
of secondary age in non-selective schools. Differences in incidence rates between 
Roman Catholic and Protestant children were strongly associated with difier- 
ences in socio-economic status. Other factors appearing to facilitate delinquency 
were a low level of socio-economic status, retardation in attainments, and 
possibly also particular age groups. Independently of the general socio- 
economic level, the location of a school in one of the old, central, and socially 
declining, areas of the city was associated with a high delinquency rate. The 
quality of a school’s morale was effective in a small but measurable degree in 
counteracting tendencies to delinquency Ways of increasing the strength 
of this influence are briefly discussed. 


I.—INTRODUCTION. 


THE purpose of this paper is to outline some of the educational and social 
factors associated with juvenile delinquency in Belfast, by analysis of one year's 
record of police cases appearing before the Juvenile Court. The explanation of 
individual variation—why one child becomes delinquent and another does not— 
rests on psychological, constitutional, and perhaps chance factors, rather than 
on social data, but analysis of the social factors involved draws attention to the 
kind of social conditions appearing to facilitate delinquent behaviour, and 
therefore, to areas in which preventive measures may most economically be 
applied, even though these measures may, from the nature of the analysis, be 
compensatory rather than radical, in that they relate to the appearance of the 
symptom and not to its aetiology. 


II.—THE DATA. 


(a) Tabulation of 492 Police Cases. 

These tables, derived from an analysis of school reports presented in each 
case, are given in full in Appendix I. In brief, they allow the following factors 
to be considered : type of school attended (level of education provided); type 
of school attended (county or voluntary administered); age distribution ; 
level of attainment relative to type of school attended; attendance record ; 
and differences in incidence rates between individual schools (this last table is 
not included in the Appendix). 


(b) The Distribution of the General School Population of Belfast. 


(i) The Annual Report of the City of Belfast Education Committee for 
1960-1, gives enrolment figures as at 31st December, 1960. These include 
children attending extra-district schools, and were the latest published figures _ 

* The views expressed are those of the author and do not necessarily represent those of 


the Belfast Authority. 
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available. They have been used as the basis for comparisons between groups of 
schools, age groups, etc. (See Appendix II.) 

(ii) For comparisons between particular schools, the Annual Statistical 
Returns for 1961 have been used. These give figures for enrolments on 12th 
January, 1962, classified according to age at 31st December, 1961. We are, of 
course, interested only in the distribution for children above 8 years of age, 
since these axe technically the children ' at risk.' 


(c) A Measure of Socio-Economic Level of Each School. 


A crude index of the socio-économic background of each school has been 
obtained by using the school meals service data on the number of necessitous 
children receiving free meals in each school during a week chosen at random. 
This has been termed a ' poverty rate.’ Rates were calculated for both 1961 
and 1962, in both cases a sample week in May being chosen. The rates for the 
two years correlated highly (Rs=0-93 for county secondary intermediate 
schools) showing the index to be a reliable one. In the main, the poverty rate 
for 1962 was used subsequently. 


(d) A Measure of School Quality. 


In order to try to assess the influence of school quality on juvenile delin- 
quency, as far as possible, independent of socio-economic background, five 
senior officials of the Education and Health Departments were asked to rank 
thirteen boys' and mixed secondary intermediate schools (the equivalent of 
English secondary modern schools) in order of ' quality.' The five rankings were 
then subjected to a test of minimal significant concordance (Kendall's Statistic 
W) and were found to be very significantly concordant (W=0-80, P 0-001). 
Tt was then possible to produce a final order of quality by summing the individ- 
ual rankings for each school. 


IIL.—REsULTS. 
(1) Type of School Attended —Level of Education. 

Very highly significant differences (?—241:81, P<0-001) were obtained 
between the incidences of delinquency reported for the various types of school. 
The heaviest incidence lay among unreorganized primary school pupils of 
secondary age (1-70 per cent. of those at risk), followed by secondary inter- 
mediate school pupils (1-34 per cent.) The incidence among primary school 
pupils of primary age was much lower (0-32 per cent.), and that for secondary 

ar schools the lowest of all (0:20 per cent.). Incidence rates for special 
schools were very high (1-86 per cent.) but unreliable owing to the small 
numbers involved. In general, both age and ability appear to be involved, as 


well as the cane of other factors. The overall incidence rate for school 
children at risk was 0-76 per cent. 


(2) Type of School Attended—Religious and Administrative Affiliation. 

There were no significant differences between county and voluntary schools 
overall (y*==1-21, P<0-30), or between county and voluntary primary school 
children of primary age (x*—0-005, P<0-95), or between county and voluntary 
secondary intermediate schools (y8=0-655, P<0-50) , or between county and 
voluntary E.S.N. special schools (y?=0-002, P<0-98). There were highly 
significant differences between county and voluntary primary school children of 
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secondary age—i.e., attending unreorganized primary schools—(y2=10-89, 
P «0-01) favouring county school children, and between Protestant and Roman 
Catholic grammar school children (4?—6-48, P<0-02) favouring Protestant 
children. Even where differences were not significant, incidence rates in 
voluntary schools tended to be higher than those for comparable county schools 
(the sole exception to this tendency being in the case of E.S.N. special schools). 
However, when allowance is made for the socio-economic difference, and when 
this factor is held reasonably constant, differences in delinquency rates associa- 
.ted with the religious or administrative affiliation of schools, tend to disappear. 
For example, when county grammar schools zre compared with Roman Catholic 
grammar schools (i.e., omitting Protestant voluntary grammar schools, which 
may be assumed to draw pupils from on the whole higher socio-economic back- 
grounds) y?=0-15, P «0-70. From another point of view, an inspection of the 
poverty rates for the unreorganized primary schools, where the biggest difference 
between voluntary and county schools occurs, shows that the voluntary schools 
have a considerably larger number of necessitous children on rolls than have the 
county schools, size held constant. It is probable that whereas reorganization 
of county schools follows a plan which is relatively unaffected by the socio- 
economic background of the children concerned, in the case of voluntary schools 
the poorer parishes are understandably slower to reorganize. There is, perhaps, 
in consequence a selective factor operating to increase the delinquency rates of 
voluntary unreorganized primary schools relative to comparable county schools. 


At the same time, since the poverty rates for voluntary schools in Belfast 
tend to be higher than those of comparable ccunty schools, at all levels of educa- 
tional provision, there is an implication that perhaps a bigh poverty rate may 
be less significant for delinquency at the primary age, and within certain ranges of 
ability. 


(3) The Age Factor. 


The evidence clearly shows that delinquency rates increase with age, uptothe 
end of the period of compulsory education (i.e., at 15 years of age, in the case 
of Northern Ireland children). Thereafter, the rate falls again, (presumably 
because the over-age children still at school are selected by ability, and by social 
class, and are less subject to delinquent influences). 

There is some evidence of an abnormally large increase in the delinquency 
rate at 12+, and to a lesser extent another at 13+4-. The standard increase in 
incidence rate from one age group to the next appears from this data to be 
approximately 0-16. This is consistent up to the 124- age group, when the 
increase jumps to 0-44, and for the 13+ group it increases by another 0-32. 
Finally, in spite of the generally and consiscently increasing delinquency rate 
from one age group to another, the differences between the rates for 13+ and 
14-- were not in fact statistically significant '413—0-27, P 0-70). This finding 
modifies to some extent the notion that the peak delinquency rate must neces- 
sarily be in the last year of compulsory schocling. 


(4) Attainments. 

There is a significantly heavier incidence of retardation among delinquents 
than onemight expect on achance basis (y*— 1€4-66, P<0-001, on the assumption 
that the ratios of above average : average : b2low average children in the school 
population is 20 : 60:20. Any other symmetrical distribution would increase 
x? still further). For children still at school, there is no significant difference 
between primary and secondary intermediate schools in regard to the proportion 
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of delinquent children who are retarded (33—2-87, P<0-30). However, if one 
includes school leavers in the assessment, the tendency towards a higher 
incidence of retardation among secondary school children than among primary 
school children, becomes statistically significant (y?=6:74, P<0-05). This 
suggests that the retarded school leaver is in even greater hazard than the 
retarded secondary school child, in terms of delinquency. There appears to be 
no association between delinquency and academic standing in the grammar 
school, although numbers are very small. Rank order correlations between the 
delinquency rates for all mixed and boys’ secondary intermediate schools and 
the rate showing the proportion of retarded children among the delinquents 
from each school, are consistently negative, and of the order of — 0-30, but do 
not reach an acceptable level of statistical significance. However, the consistent 
appearance of negative correlations here does suggest that high delinquency 
rate schools tend to have a generally lower standard of attainment, to the extent 
that the delinquent groups from these schools are less likely to be assessed as 
retarded. 


(5) Attendance. 


The tendency appears to be for delinquents as a group to show somewhat 
poorer attendance records than one might expect from non-delinquent children, 
but there is no adequate basis of comparison here, and many delinquents seem 
to have been excellent attenders, and the great majority satisfactory in their 
attendance. There is, in fact, no evidence for supposing that regularity of 
attendance in itself has any special bearing on delinquency. 


(6) Socio-Economic Status of School Background. 


Correlations between this factor and delinquency rates are fairly consistent- 
ly positive, high, and statistically significant, when groups are analysed that are 
reasonably homogeneous by sex and religious affiliation. For example, for 
county secondary intermediate schools—excluding girls’ schools—Rs is 0-88, 
P«0-01. For all voluntary secondary intermediate schools Rs is 0-79, P<0-05. 
For both county and voluntary secondary intermediate schools—excluding 
girls’ schools—taken together Rs drops to 0-65, P<0-05. For eighteen county 
primary schools having delinquents Rs is 0-46, P<0-05. 


A strong association between high poverty rates and high delinquency rates 
is clearly established, especially at the secondary intermediate school level. 


(T) Size of School. 


Rather surprisingly, significant associations have been found between 
small schools and high delinquency rates. One might have expected to find that 
the more personal atmosphere of a small school would be associated with lower 
delinquency rates, but, in fact, the correlation between size and delinquency 
rate is a negative one at both the primary and the secondary level. For county 
primary schools having delinquents—eighteen schools in all—Rs is — 0-76, 
P«0-01. For county secondary intermediate schools—ten boys’ and mixed 
schools—Rs is — 0-68, P «0-05. At the primary school level one might attribute 
part of this correlation to the fact that the calculation of delinquency rates 
from small numbers of delinquents tends to favour the larger primary schools. 
However, the same objection cannot be made to the results from secondary 
schools, where no school has less than 500 children at risk, and yet the correla- 
tions are roughly of the same magnitude and in the same direction. 
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Correlations between ‘size of school’ and ' poverty rates’ are also of 
interest. For the eighteen county primary schools Rs is +0-09, N.S. For 
the ten county secondary intermediate schools Rs is— 0-83, P<0-01. Thesmail 
primary schools are thus not ‘ impoverished,’ whereas the smaller secondary 
intermediate schools do tend to be. 

The rationale behind this particular group of findings may be as follows : 
(a) The larger secondary schools tend to be the newer schools, and schools in 

newer housing areas, where the general socio-economic level may be 

expected to be of a higher order than in the older areas of the city served by 
smaller schools. 

(5) While there may be no significant association between size of primary 
school and socio-economic level, inspection of the list of schools concerned 
shows that a number of the smaller primary schools are high up in the 
delinquency list and also in the poverty ranking. It is also apparent that 
these schools tend to be congregated in certain areas of the city—notably in 
Court Ward, St. Anne’s, and other central areas. Their common character- 
istic may be not merely a low level of socio-economic status, which may be 
true of many other primary schools in Belfast, but also being centred in 
what E. Jones (1960) has described as areas of declining importance from the 
social and residential point of view, where space is at a premium, and where 
the general trend of migration is outwards for the more ambitious and 
inwards for the socially deprived. 

In the case of the county intermediate schools, while socio-economic status 
clearly must contribute to the size of the correlation between size of school and 
delinquency, when socio-economic status is partialled out, using Kendall’s Tau 
and partial rank correlation technique (Siegel 1956), there is still a substantial 
negative correlation between size and delinquency rate (Tau= — 0:43, N=10) 
which may well be statistically significant. 


(8) School Quality. 

A. B. Clegg (1962d) has reported an investigation with thirty West Riding 
of Yorkshire secondary schools, in which he found a correlation of 0-50 between 
school quality (as measured by rankings of a panel of four senior officers of the 
Education Department) and Delinquency rate, with high quality schools 
tending to show lower delinquency rates. He further found “ no correlation 
at all " between School Quality and the Social Desirability of the areas in which 
the schools were located (as measured by the rankings of senior county council 
officials), and a correlation of 0-02 between Delinquency and Social Desirability. 
He concluded that “ the evidence from these schools supports the commonsense 
view that, even in adverse circumstances, the school itself can do much to 
combat delinquency and juvenile crime." For thirteen Belfast boys' and mixed 
secondary intermediate schools this analysis shows a correlation between low 
quality and high delinquency rate well in line with Clegg's finding (Rs=0-56, 
P<0-05). But contrary to Clegg’s finding, the correlation between socio- 
economic status, or ' poverty rate, and delinquency rate, is even higher 
(Rs=0-65, P<0-05). The correlation between school quality and poverty 
rate is 0:63, P «0-05. : 

In fact, if socio-economic status, or ' poverty rate,' is pertialled out (now 
using Kendall’s tau throughout) the correlation between school quality and 
delinquency rate drops from Tau—0-45, P<0-05 to Tau=0-29 (no tests of 
significance available, Siegel 1956). 
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Thus, Clegg’s conclusion is not so optimistically borne out by this analysis. 
There is certainly a measurable effect due apparently to school quality, or to 
some related factor, but this effect seems to be considerably less than that of 
socio-economic influences. Indeed, if this were not so, any school with a high 
delinquency rate would be saddled with some burden of guilt for failing to 
combat the seeds of crime. 


IV.—LIMITATIONS OF THE DATA. 


(a) Possible errors in the data: duplication due to recidivism has been 
eliminated (reducing the number of cases from 508 to 492), and cases who had 
already left school at the time of their court appearances have also been identi- 
fied (leaving 418 cases of children still at school). However, other minor classi- 
ficatory errors may still exist—e.g., a child before the court may be the subject 
of a report from a school which he, in fact, no longer attends, or did not attend 
when he committed the delinquent act. 


(b The data must, of course, be regarded as applying to ' police cases 
before the Juvenile Courts' rather than to Juvenile Delinquency in Belfast in 
general. Intelligent delinquents, delinquents from well-to-do homes, and 
younger delinquents may well be under represented. 


(c) No distinction has been made between male and female delinquents, 
except incidentally when this is revealed through the schools attended. The 
universal finding is that male delinquency greatly exceeds female delinquency, 
particularly during school ages. For example, in 1954, of 337 children appearing 
before the Belfast Juvenile Court in police cases, only sixteen were girls. It is 
not unreasonable, therefore, to exclude girls' schools from this analysis, which 
is what in fact has been done. However, by the same argument we should 
strictly speaking consider only the boys at risk in for example mixed secondary 
schools, otherwise a comparison including both boys' and mixed secon 
schools will weight the delinquency rates of the boys' schools unfairly. In fact, 
although only correlations based on delinquency rates calculated from total 
School enrolments have been quoted in Section III, similar data was also 
obtained for delinquency rates calculated for boys' enrolments only, and this 
reproduced similar patterns of results in terms of the size and direction of the 
correlation coefficients. 


(d) No distinction is made between indictable and non-indictable offences, 
and all police cases are assumed to be of equal ' social significance,' although the 
typical delinquent act may, in fact, vary from age group to age group. However, 
at all the ages under consideration, the great majority of offences relate to some 
form of Larceny and/or Breaking and Entering, so that this objection is not 
particularly powerful in this context. 


(e) For some of the data there is no adequate standard of comparison— 
for example, ' level of attainient' is a purely relative and probably rather 
subjective assessment. Similarly, for ‘level of school attendance’ there is no 
adequate control data. 


(f) The index of socio-economic background used here is a very partial 
estimate, stressing only one aspect of status, and may possibly introduce an 
element of age selection—e.g., families with older children may be less likely 
to fall within the ' necessitous ' group. Also, a status index of this kind can be 
misleading in that it is not a continuous measure of status from low to high, 
but an index only of the amount of low status in any given school. 
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(g) The measures of ' School Quality ' and ‘ Socio-Economic Status’ are 
apparently highly reliable, but they may not necessarily be valid measures. It 
is possible for example, that the ‘ “choot Quality ' index may be in part just a 
* better ' measure of ‘ Socio-Economic Status’ than is the Poverty Rate used 


for that purpose. 


V.—DISCUSSION AND CONCLUSIONS. 


The general pattern of these results is one showing the heaviest incidence 
of delinquency among children of secondary age in non-selective schools. 
Within the non-selective secondary schools a number of factors appear to 
facilitate or to be associated with delinquency—notably, & low level of 
socio-economic status, retardation in attainments, an age factor, and the 
pu that particular age groups (e.g., 12--) may be specially susceptible. 

ndependently of the general socio-economic level, the location of a school in 
one of the old, socially declining areas of the city seems to be associated with an 
increased delinquency rate, and the quality of a schools’ morale or influence 
seems to be effective only to a rather limited degree in overcoming and counter- 
acting more powerful tendencies to delinquency. 

Typically then it is not the near-failures at 11-- who tend to become 
delinquent in the main (in the secondary intermediate schools with good 
academic standing and opportunities, the proportion of retarded children in the 
delinquent group is higher than in those schools of generally lower academic 
status). It is rather those who are academically retarded, or who lack academic 
goals, who are most susceptible. The selection process itself does not seem to be 
a significant factor, although the educational segregation that follows it may 
tend to minimize the compensatory influence that ‘school quality’ might 
otherwise exert. 

In general, these findings confirm those reported for similar areas in the 
United Kingdom. This is in itself of interest, since the consistently high level 
of unemployment in Belfast, and the existence of deep-rooted religious and 
social differences which have in some respects produced parallel cultures within 
the same geographical community, might have been calculated to exert a 
differentiating influence on delinquency rates here. Barritt and Carter (1962) 
have shown, however, that socio-economic differences may account for a great 
deal of what ostensibly appears to be difference due to religious affiliation, 
because socio-economic status in Northern Ireland is correlated with religious 
affiliation. The evidence here suggests that the same is true of apparent differ- 
ences in delinquency rates as between Roman Catholic and Protestant schools, 
where these are statistically significant. 

The influences of socio-economic status and educational retardation have 
been attested by among others, Ferguson (1952) on Glasgow data, and Burt 
(1944) in his analysis of London children after the first World War, concluded 
that while poverty in itself made a comparatively small contribution to 
delinquency, conditions associated with poverty, or facilitated by it, were of the 
very highest significance (e.g., defective discipline especially in large families, 
intellectual disabilities, detrimental interests, neighbourhood influences, etc.). 
But while the influence of this ' poverty complex ' appears to be strong in the 
Belfast data, and in spite of the high unemployment rate in Belfast over many 
years, nevertheless, delinquency rates in Belfast compare not unfavourably 
with those of most other large cities in the United Kingdom. The ' poverty 
complex ' operates within a cultural setting in which other factors can apparent- 
ly modify its influence considerably. 
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The particular modifying influence on which this analysis provides some 
evidence, is that of the school. It is commonly assumed that the school can be 
a positive force against delinquency. The Scottish Advisory Council on 
Education (1952) ‘concluded that the evidence “ gives some encouragement to 
teachers who believe that the school can exert a favourable influence on pupils 
living in an area where delinquency is common." Mays (1962) argues that 
reducing the incidence of juvenile delinquency is basically an educational 
problem. Clegg (1962 a, b, c, d) in a series of brief reports and memoranda, 
has written most stimulatingly on this possibility, although he describes the 
attitude of some members of the teaching profession to this question as one of 
reserve, commenting that “ it is almost as if they were prepared to forgo the 
credit for inhibiting delinquency if they can'escape censure for not doing so." 
(Clegg, 1962 d). However, this analysis of Belfast data seems to indicate that 
the influence of ‘ school quality ' is much less than that of say ' poverty.’ How 
can one reconcile the optimistic and the less optimistic results ? Under what sort 
of conditions may ' school quality ' become a powerful influence in combatting 
delinquency ? 

On the first point there are, of course, many possible explanations of this 
discrepancy—possible errors of measurement and analysis aside, there are 
differences in the overall level of delinquency rates, and probably in genera 
socio-economic level as measured by a ' poverty rate' ; the West Riding data 
refers only to indictable offences, and samples a wide area using broad assess- 
ments of ' social desirability, while the Belfast analysis covers a small area, 
using a narrow but more finely graded measure of ‘ poverty.’ Any or all of these 
factors could conceivably tend to affect the apparent influence of school 
quality as against socio-economic background. Similarly, the inclusion or 
exclusion of girls’ secondary schools could have a very large effect on the size 
of the correlation between school quality and delinquency, and between 
Socio-economic background and both these factors. Finally, it is possible that 
the West Riding estimates of ' school quality ' are less contaminated by judg- 
ments based partly on socio-economic impressions, or indeed, that the West 
Riding schools may have been more successful in influencing delinquency rates— 
both for better and for worse. But the similarity between the two findings in 
regard to the correlation between Delinquency Rates and School Quality (West 
Riding 0-50, Belfast 0-56) makes this seem improbable, and the total absence 
of correlation between School quality and Socio-economic background in the 
West Riding data itself requires explanation. It seems not unreasonable, there- 
fore, to say that at least under the general prevailing social conditions in Belfast, 
the influence that the school can exert on delinquency rates should not be 
exaggerated. 

On the second point, we enter the realms of speculation, and as Clegg 
(1962 b) points out we urgently need studies of ' good ' and ‘ bad’ schools from 
the standpoint of delinquency. But from what facts we have it seems that if the 
quality of a school's morale can influence delinquency rates as the evidence shows 
that to some degree it already does, then the real problem is how to manifest 
that quality in the lower streams equally with the top. In recent years, the 
stress on quality and parity has been related to providing opportunities for the 
114- failures. If we ranked our schools for the quality of their bottom streams 
we might find a much higher correlation with delinquency rates. In larger 
secondary schools of good academic standing, the organization of departments 
for the retarded streams, with some autonomy within a flexible framework, seems 
promising. In smaller, or less strong academically, secondary schools a more 
experimental approach is urgently required. In some cases this might mean 
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experiment with an unstreamed organization (Rowe 1959). Clegg (1962, a, b) 
has described very clearly problems and methods in the development of school 
discipline. Equally important is the need to develop avenues of legitimate 
achievement, for the solitary as well as for the gregarious. ‘Priority for these 
schools and areas in the development of Youth Centres, and Further Education 
Centres, in allocations of time, space, and money for games and other shared 

. facilities, in the institution of counselling relationships between teachers and 
pupils that can be sustained beyond the period of school attendance, in the 
arrangement of small group school visits rather than the more typical large 
group visits—these may be some of the ways in which relief can be given, and 
response obtained. 

Perhaps the first priority might well be for an education authority to adopt 
its school with the consistently highest delinquency rate, as an experimental 
school, and to try to see how organization, curriculum, activities, relationships, 
and the supporting facilities of the surrounding community, could be developed 
by a joint effort to make an inroad against delinquency. 
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VIII.—APPENDICES. 
APPENDIX Í.—Tabulation of 492 Police Cases. 
TABLE 1A 
TYPE or SCHOOL ATTENDED. 


I 


County SCHOOLS VOLUNTARY SCHOOLS 


At Left | Total] At Left | Total 
School {School School |Schoo! 


PRIMARY—Primary age 44 29 — 
Prmary—Secondary Age . ..| 20 57 18 
SECONDARY (Intermediate)t ..| 160 74 i8 





4 2 
10 — 


238 34 





*— Includes 4 boys attending a voluntary non-R C. grammar school. 
t— Secondary Intermediate’ 1s equivalent to ‘ secondary modern.’ 


Age Groups 
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TABLE 1B 


AGE RANGE OF DELINQUENTS 


321 


16--| Total 





Attending School. . 
Left School .. 





Above Average 
Average 
Below Average 





TABLE 1C 


ATTAINMENTS OF DELINQUENTS. 


PRIMARY SECONDARY GRAMMAR 
At Left 


School | School 
8 12 

92 124 
50 98 


284 


Left 
School 


At 
School 


Left 
School 


2 2 
15 14 
28 3 


19 


At 
School 


45 





TABLE 1D 


ATTENDANCE OF DELINQUENTS 


At School Left School 





418 
74 


30 
30 











77.295 


Satisfactory / Excellent 


Irregular/Bad . 


22.89; 





TOTAL NUMBERS INVOLVED 





322 Juvenile Delinquency 


APPENDIX 2.—Distribution of School Population of Belfast in Relation to 
Delinquents. 
TABLE 2A 


TYPE oF SCHOOL ATTENDED 


County Schools | Voluntary Schools | Total Numbers 
D P D P D P 








PrIiMaRY—Primary age ..| 60-3% | 60:7% | 39.794, | 39:3% | ‘73 
PRIMARY-—Secondary age | 26-0% | 49-895 | 74.095 | 50.29, 
SECONDARY (Intermediate) 68-49, | 71.094 | 31.097 | 29-092 | 234 


49.29, * 73-19, *| 57-89, | 26-9% 
Sreca (E.S.N. only) ..| 66-7% | 66-295 | 39.39, | 33-8% 15 


Add (Other ‘ Special’) .. 278 





* Includes voluntary non-R.C schools 
D--Delinquents. PPopulation Totals | 418 | 55270 





TABLE 2B 
AGE RANGE 


Age Groups 154- 


% of school ula- = 


——— 
tion aged 164- . . . . . 5 ‘ . 12.4 55270 
95 of delinquents at . E . g . . . 6:7 | 72 418 





COMMENTS ON 


SOCIAL FACTORS AND SCHOOL INFLUENCE IN JUVENILE 
DELINQUENCY zv G. A. DELL. 


By A. B. CLEGG 
(The Education Officer, West Riding of Yorkshire County Council) 


Mr. Dell’s investigation is of considerable interest and confirms the results of 
Burt and others in approximately similar urban environments. The main difference 
between his results and the results of what was a very simple investigation by a local 
authority in the West Riding lies in the relationship between school quality and 
socio-economic status. I am satisfied from the West Riding enquiry that a school in 
which the morale is high because the children are finding school life worth while can, 
and indeed often does, make a contribution to the standards of behaviour ın the area. 
Whether this could happen in a dense urban area where standards were really low, 
is another matter. I personally believe it could and that in some urban areas 
evidence could be found of its having done so. It would, of course, be an interesting 
and valuable piece of pioneer work if, as Mr. Dell suggests, a large Authority could 
staff a school in a really bad area with a task force of skilled and sensitive teachers 
and then follow up the results to see whether they had made an impact on the wider 
social environment. 


” 


SUMMARIES OF RESEARCHES REPORTED IN DEGREE 
THESES 


A Projective Technique for the Assessment of Occupational Interests, 
with Special Reference to Creative and Artistic Capacities and Decision 
Making. 


Bv LISBETH BRIESS, M Sc., Ph.D. 


(Abstract of a Thesis submitted for the M.Sc. degree im Psychology of the University of 
London.) 

AIM. : 

To devise a projective technique, capable of being scored objectively, for the 

assessment of occupational interests. 


THE TECHNIQUE. 

The Subjects (S) are presented one by one with 75 ambiguous drawings on. cards, 
S being permitted only one of two given interpretations ; S has to decide whether, 
for instance, a drawing represents a human being or an animal, a human being or a 
slab of stone, a dog or a car, in a way similar to that used by Frenkel-Bru and 
Vernon Hamilton,? but so designed that perseveration is excluded. There are 
25 cards with the alternatives ‘human or animate’ (‘animate ’ means animal or 
plant), 25 cards with the alternatives ‘ animate or inanimate,’ and 25 cards with the: 
alternatives ‘ human or inanimate’ If S is unable to make a choice he has to state 
that he cannot decide and that it could be either. Another 25 cards with unstruc- 
tured drawings are mixed amongst these, and S is asked whether the drawings are 
meaningful or not meaningful for him. 

The 75 cards with alternative choice have the purpose of assessing (a) a Subject's 
basic interests; (b) his distastes, the importance of which has been stressed by 
Rodger*; and (c) his general ability to make decisions. The 25 unstructured 
drawings give information about his capacity for doing creative and/or artistic 
work. 

Every drawing had to satisfy a criterion before it was incorporated in the set 
for the actual experiment. The picture had at least once in eight successive responses 
to be interpreted in both ways and this had to happen for three consecutive sequences 
of eight interpretations. It was ascertained that there were no significant differences 
in the interpretations between men and women, and between Europeans and 
non-Europeans who had lived in our society for some time 

After an unscored trial with three other pictures, the thoroughly shuffled 
cards are presented one by one, and S is instructed to give the very first impression 
as quickly as possible, but there is no time limit set. Questions are then asked about 
occupation, interests and leisure activities. The scores consist of the number of 
choices S has made for every one of the five categories, called briefly ' Human (H),’ 

‘ Animate (A),' ' Inanimate (I), ‘ Either (E),’ and ‘ Not meaningful 

After a preliminary pilot experiment two experiments were carried outina very 
similar way, but with different sets of cards in order to make sure that the results 
were not attributable to features of the cards 


EXPERIMENT I. 

This experiment was carried out on fifty Subjects. They consisted of members of 
the academic, administrative and technical staffs and students of several departments 
of Birkbeck College and the London School of Economics, and social workers. Their 
ages were 17 years to the middle 50s., and there were about as many men as women. 

Reaction-times were taken between the Experimenter reading the description 
ofa card and S commencing to say his choice This experiment gave highly significant 
results in support of the hypotheses that a Subject's general ability to make decisions 
and his capacity for doing creative and for artistic work can be assessed in this way, 
if his answers to the questions about special interests and spare-time activities may 
be taken as a crude criterion prior to a systematic validation. The results of this 
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experiment with regard to interests are, as likely as not, due only to chance, as the 
sample was too heterogeneous. No attempt was made with this heterogeneous 
sample to investigate distastes. 

The mean E-scores for those S’s who knew what kind of occupation they wanted 
were significantly lower than for those who had not yet made up their minds. 
As older people, who are already in occupations, are trained at Birkbeck College, 
the sample was very useful for this part of the investigation. The Mann-Whitney 
test* for the individual E-scores gave a significant result at the 1 per cent. level. 
The reaction-times before scoring an E were significantly higher than before scoring 
any other category. By analogy with the claims for word-association tests, the 
assumption cannot be excluded that the inability to make a decision has such 
emotional repercussions, even in a simple experiment, that there is a correlation 
between underlying emotions and reaction-times.* 

The S's with no artistic /creative interest or spare-time activities had signicantly 
higher NM means than the others. 


` Experiment Il. 

This experiment was carried out in a manner similar to Experiment I, but with 
specially selected Subjects and a new set of drawings. Two samples were obtained, 
one mainly of thirty-three students of science and engineering, and junior laboratory 
technicians who intended to become university students; the second mainly of 
forty-seven casework students and students who had the special desire to become 
teachers. The S's were all Europeans, 17-25 years old, with at least qualifications 
for entering a university. They were not aware that they had been selected for the 
sample as social workers, scientists, etc These samples yielded sufficiently encourag- 
ing results with regard to interests to make the investigation of other occupations 
worth while, although the validation has to be repeated because the samples were 
small. Reaction-times were not taken with this sample to make sure that there 
was no distraction from the process of externalisation. 

With this sample, distastes were investigated in the following way. If S scores 
much lower 1n one or two of the categories H, A, or I than in the others, this item 
may concern S so little, or be so strongly repressed that any dwelling on it would 
produce a distaste." It has been postulated that a distaste is indicated if Chi-square 
between the lowest and the highest scores is significant at the 5 per cent. level. 
Distastes according to this postulate for one or the other category of interests are 
indicated with high frequency. 


FACTOR ÁNALYSIS. 

With the data of Experiment II Stephenson's inverted technique of factor 
analysis, by which Subjects are correlated, was performed, and multi-factor 
analysis, as described by Cattell, yielded four general factors which coincided with 
personality dimensions. 

The results of one factor analysis have, of course, to beinterpreted with caution, 
But since the M.Sc. examination, the factor analysis has been repeated on two mare 
samples, and the same patterns have emerged for the loadings of the four factors ; 
thus, the procedure has yielded the same factors three times. One of them is a bi- 
polar factor for what Super* has described as “ . . . scientific interest, technical or 
material interest, humanistic or social welfare interest . . .'', which contrasts 
scientific interests on one side with humanistic or social welfare interests on the 
other. 
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A Survey of the Attainments and Interests of Ten-year-old Children from 
Differing Geographical Environments within a County. 


Bv A. N. SMITH 
(Abstract of a thesis for the degree of M.Ed., in the University of Manchester, 1962.) 


AIMS. 

The survey aimed to discover whether the attainments of 10-year-old children 
in English and Arithmetic and their interests, differed according to their environ- 
ments: rural, urban and ‘ fringe.’ 

Subsidiary questions were : 

(a) Is there a uniform pattern of interests amongst the 10-year old children in 
different localities ? 

(b) Do children's academic and social advantages vary according to geographical 
environments ? 


THE CHILDREN STUDIED. 

The survey was limited to twelve maintained schools, chosen largely because 
their head teachers agreed to co-operate in the investigation, but also distributed 
over different geographical environments within a county. The schools were then 
classified, with the assistance of their head teachers, as ‘ urban,’ ‘ fringe ' or ‘ rural.’ 
All the 10-year-old children in the ‘ fringe’ and ‘ rural’ schools were studied, but 
in the ‘urban’ schools a sample was taken from an alphabetical list. This was a 
50 per cent. sample selected by taking every other name on the list. 


METHOD. 

Three main areas of environmental influence upon the children were examined— 
the school, the home and the neighbourhood Information about all three areas was 
collected by guided-interview techniques using a questionnaire guide, and also 
school log books, census reports, school records and personal observation. All 
interviews were conducted by one person to ensure a set pattern of procedure. 
Attainment scores were extracted from primary school record cards and a modified 
form of an interest schedule was also used (Blyth, W. A. L., 1954). 


ANALYSIS OF DATA 

(1) The School Environment. The schools of the urban, fringe and rural 
localities were compared in a number of respects; the number, training, qualifica- 
tions, experience and interests of the teaching staff; the physical conditions and 
history of the schools ; the school children ; the organisation and the curriculum of 
the schools; the contact between children and teachers and between the school 
and the home. 

(2) The Home Environment. The occupational groupings of the parents, the 
education of the parents, the size of the family, family interests and contacts and 
the use of the mass media in the family group, were all compared between one 
locality and another 

3) The Neighbourhood Environment. Comparisons were made noting the 
facilities in the homes of the locality, the size and nature of the gardens, the numbers 
of playmates, the play areas to be found and the availability of transport. 
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(4) AHainmenis. The scores of Moray House English, Arithmetic and Verbal 
Reasoning tests for 9-year-old children were extracted from the School Record 
Cards and comparisons between the three types of environment were made. The 
scores of Southend and Vernon Reading and Arithmetic tests were also extracted 
from School Record Cards and compared. 


RESULTS. 

The fringe area children appeared to have more advantages which stemmed from 
the home. The correlation between localities and the occupational groupings of 
the fathers was significant at the 1 per cent. level, but some of the advantages 
appeared to originate from- locality rather than socio-economic sources. Thus, the 
lack of suitable play areas for urban children was a result of the nature of the 
urban locality, as were the limited contacts with other children in the rural areas. 

In an attempt to derive an overall impression of the environment, the urban, 
fringe and rural areas were assessed as being above-average, average or below- 
average in respect of the many aspects examined and ‘ plus’ or ‘ minus’ scores 
were calculated. These scores summarise the advantages or disadvantages of the 
three areas. 





oe 
The cofficient of contingency for the above figures is significant at the 1 per ent. 2 
level. MUT 


CONCLUSIONS. x^ 

(1) The children from different geographical environments are influenced in 
different ways. (The differences may be diminishing because of improved com- 
munications and the influence of the mass media.) 

(2) The urban children appeared to have the greatest number of unfavourable 
environmental conditions (more limited background, fewer amenities in the home, 
less adequate play facilities) although they have some environmental advantages 
(easy contact with other people, entertainment facilities readily accessible). 

(3) The children of the ' fringe’ areas have considerably more advantageous ` 
environmental conditions, the limitations of the area being easily outweighed by 
its advantages. 

(4) Children in the ‘ fringe’ areas showed higher attainments in English and 
Arithmetic than those in either the urban or the rural area, between whorn there 
were no significant differences in this respect. These advantages and those referred 
to in conclusion (3) above, may have been associated as much with the socio- 
economic status of the parents as with the nature of the locality. 

(5) The interest schedule suggested that there were some locality differences 
in the interests of the children, although it was again probable that the mass media and 
improved communications have minimised the differences. There was considerable 
overlap of interests between the areas 

(8) Thus, within its sampling limits, this survey indicates the existence of 
relatively few locality differences which are not explicable in socio-economic terms 
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An Investigation into the Effictency of Schools Television Lessons in the 
Teaching of Elementary Science. 


: By ALBERT ROYDS 


(Summary of thesis presented for the degree of Ph.D. sn the Unwersity of Manchester, 
1963.) 


Am. 

The hypotheses set up for investigation were : 

(1) That, after viewing a schools television lesson in elementary science, pupils’ 
recall of its subject matter is influenced by : 

(i) factors in the structure of the lesson and its presentation, such as: lesson 
content; lesson method; and programme structure ; 

(ii) factors in the pupils’ own natures and environments, such as: their sex ; 
intelligence; geographical distribution, type of school; their socio- 
economic grade. 

(2) That there is interaction between the factors of (i) and (ii) mentioned above, 
i.e., between the lesson and the pupils’ natural and environmental character- 
istics, 


METHOD. 

1,578 pupils, 872 in primary, 649 ın secondary modern, and 57 in grammar 
schools, wrote answers to questions based on the content (subject matter and 
“presentation details) of an elementary science lesson, which they viewed under 
classroom conditions, as part of a complete series of television lessons. 


RySULts. 
Sn The principal results are summarised as follows : 
m ~The factor which has the greatest influence on recall of subject matter is the 
grade of pupils’ intelligence. Their intelligence grade is also shown to influence 
: their intention to attempt post-programme activities suggested for them. 
(2) Girls evince greater interest than boys. 
(3) Pupils'age has an effect on; their interest, their recall of vocabulary, of scientific 
applications, and of attention-gaining devices from the lesson programmes. 
(4) Urban pupils show greater interest than rural. Urban pupils also show better 
recall of basic facts. “e 
(5) Factual learning of science applications is greater for secondary than for primary 


: pupils. 


(6) Pupils in the upper social grade show greater interest and greater intention to 
undertake suggested activites. 
(7) Pupils’ interest in the lesson programme topics shows a relationship with 
teachers’ estimate of their teaching importance. 
(8) In the presentation of the lessons, matters which influence pupils’ recall 
include : 
(1) topics which involve reasoning, aba by the narrator or by the pupul, are 
more effective than those stated dogmatically ; 
(u) repetition helps pupils’ interest and their intention to undertake activities ; 
(ii) length of time devoted to a topic, its position early in a progamme, and 
its narration at conversational pace, add to the impression made ; 
(iv) appearance of reality, association with living (including human) creatures, 
and mobility, contribute to effective recall. 
(9) Pupils learn vocabulary from the lesson programmes. 


CONCLUSIONS, 
The results of the investigation support the experimental hypotheses, 


BOOK REVIEWS 


Biccs, J. B. (1962). Anxiety, Motivation and Primary School Mathematics, 
London: National Foundation for Educational Research. Occasional 
Publication, No. 7, pp. 388, 30s. 


This published thesis of an extensive inquiry into children’s achievement in 
arithmetic, in relation to anxieties and to teaching method, is an 1nteresting con- 
tribution to the growing literature on children’s learning and achievement in 
mathematics. It is a publication for research workers for its detailed and specialised 
treatment make it unsuitable reading for readers lacking knowledge of psychology 
and psychometrics, but the problems it raises, perhaps more than the conclusions 
arrived at, are of universal interest to teachers of mathematics. 

Dr. Biggs briefly, but comprehensively, considers research into the nature of 
mathematical ability, and observes that whereas in theories of learning one basic 
process is postulated in the theories of Wertbeimer, Piaget, and Dienes, different 
kinds of thinking are distinguished. In particular he contrasts learning and problem 
solving „which result from appreciation of structure with that which takes place 
through. rote learning or some procedure which does not involve insight into struc- 
ture; and he questions whether they can be considered as different aspects of one 
process. 

In discussing the role of anxiety in learning, he finds both contradictions and 
inadequacies in existing theories. He 1s critical of the equation of drive with 
anxiety, but disagrees with the finding that anxiety is always interfering. Biggs 
postulates that those easily aroused to anxiety will perform better until the difficulty 
of the task causes anxiety to mount to an inhibiting degree, and that inhibition is 
less likely to set in if the method of learning is insightful. He sees anxiety in relation 
to arithmetic as resulting from innate tendency to anxiety, to methods which employ 
anxiety as a stimulus and to the difficulty of the task. 

For this major investigation, involving nearly five-thousand children from 
eighty-seven schools, teaching methods were classified as traditional, structional or 
motivational. (In traditional methods responsibility is not delegated to the children ; 
they are taught as a class, use a text-book and are rarely allowed to co-operate. In 
contrast, structural materials are intended to be used by individuals so that mathe- 
matical relations may be discovered at the child’s own pace. Motivational methods 
use centres of interest, often miniature life situations to encourage insight into the use 
of numbers and measures and children often co-operate in their work). The tests used 
were to assess intelligence, four aspects of anxiety, and three kinds of arithmetical 
achievement—mechanical arithmetic, problems, and understanding of concepts. In 
addition, ratings of social status, parental interest and ability were made for each 
child by teachers. Use of the tests raises problems: girls gain higher scores than 
boys in general anxiety, but this may be because they are more willing to admit 
anxiety or because boys feel a need to falsify responses ; concept tests cease to test 
concepts when the relationships are easy or well known-—the response is changed 
from an insightful one to an automatic one; in tests of mechanical and problem 
arithmetic traditionally taught children -are already test sophisticated whereas 
motivationally taught children are rarely, if ever, tested. An unexpected difficulty 
to the reader is that the concept tests specially prepared for this investigation are 
not included in the text 

Statistically, the results are dealt with by a factor analysis and analyses of 
variance. The chief factors which appear are of arithmetical progress, anxiety 
attitudes centred around school, and anxiety related with mechanical arithmetic. 
and formality of teaching. It 1s somewhat tedious to follow no less than a dozen 
exploratory three-way analyses of variance, but they provide numerous significant 
interactions which deserve further investigation. Among the more interesting 
indications are that methods of teaching which emphasise insight induce less anxiety; 
that performance, especially in mechanical arithmetic, worsens steadily as number 
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anxiety increases (or vice versa); and that girls tend to favour rote learning and 
trial and error methods whereas boys show more appreciation of structure. An 
additional smaller investigation suggests that where girls have been taught for 
several years by methods giving abstractive experience, they use insightful methods 
more readily in working problems. 

This is an investigation which should stimulate other workers in the field 1n 
attempts to understand how children learn to think mathematically, what motivates 
them and in what ways insightful thinking can be encouraged. R. M. BzARD. 


CooLEY, WILLIAM W. and LonuNzs, PAUL R. (1962). Mathematical Procedures 
for ihe Behavioral Sciences. New York: John Wiley and Sons, Inc., 
pp. X+211. 


This book presents computer programmes in the Fortran code for multiple 
and canonical correlations, two and three way analysis of variance and covariance, 
multiple discriminant analysis, principle components, factor scores and varimax 
orthogonal rotated factors There are still far too few programmes in this country 
for the main statistical procedures used by psychologists, and this volume will, 
no doubt, interest the expert programmer. The snag is that, although coding on 
difference machines 1s largely based on common principles, these are less important 
for practical pruposes than the idiosyncracies of individual machines. Few people 
capable of understanding Fortran will need ıt to programme their Pegasuses, 
Mercurys and Atlases. On the other hand, the flow charts for programming any 
digital computer should prove very useful. F. W. WARBURTON. 


Fosmay, A. W., THORNDIKE, R. L., Horyat, F., PIDGEON, D. A., and WALKER, 
D. A. (1962). Educational Achievements of Thirteen-year-olds in Twelve 
Couniries. Hamburg: Unesco Institute for Education, pp. 68, n.p. 


This book is based on the sound philosophy that comparative education could 
well utilise some methods employed in current psychological research. Whether the 
corresponding practical evaluation of that philosophy is the most suitable for its 
task is a matter of considerable conjecture, for any measurement evaluation as 
between countries, must take careful account of differing underlying principles and 
educational assumptions so that any statistical treatment demonstrates rather than 
suppresses the differences involved Tests of attainment are particularly susceptible 
to national differences in curricula, philosophies, teaching methods, and a variety of 
individual attitudes Moreover, the susceptibility is to the whole profile of such 
influences, and no one influence is necessarily more important than another. 

The present study breaks new ground in organising and reporting on a survey 
of attainments in four school subjects (Mathematics, Reading Comprehension, 
Science, Geography), and on the variations in ability ranges as shown by a non- 
verbal intelligence test taken by groups of thirteen-year-olds in twelve countries. 
The methods of sample selection and the relative sizes varied somewhat from one 
country to another, and the authors acknowledge this as part of the exploratory 
nature of their work. The main comparisons between countries are by means and 
standard deviations treated ın a variety of ways, and these statistics suggest that 
Reading Compehension shows least difference with Mathematics showing the 
greatest. A tentative study by Hotyat of profiles of test results for two groups of 
three countries each, is extremely interesting, and worthy of much cogitation for 
future development. Likewise, Walker's study of teacher reactions is vital to an 
investigation of this nature. Other evidence includes such aspects as sex differences, 
relationships of scores to parental occupation levels, and information on the variation 
of item-difficulties from one country to another. 

Certain problems seem to obscure the conclusions Furst, any attainment test 
particularly, in Mathematics or Science) has a close relation to the syllabus covered 
and this latter is known to differ considerably between England and Europe, as also, 
between other countries. Differences in general teaching approaches must also be 
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taken into account. Straight translation or the simple adjustment of test items 
does not express these basic differences. Thus, a ‘ syllabus-difference ' factor must 
be allowed for 1n the interpretation of results. Second, the concept of an ‘ average 
mean’ or ‘average standard deviation’ between countries seems to entail a 
supposition that no relative differences exist. In this study these terms seem to have 
little genuine meaning, and an alternative criterion measure is needed. Such a 
measure must be capable of rational interpretation. Averages, in this sense, merely 
obscure the differences they are intended to evaluate. Hotyat’s use of profiles is a 
much more promising technique. 

Finally though, it may be possible that certain philosophies of school organisa- 
tion, such as streaming, may in theory tend to suggest the occurrence of larger 
standard deviations, it would, with children of secondary school age, be extremely 
difficult to demonstrate the precedence of this single factor over those described 
above, or even over straightforward differences in sampling. It is regrettable that 
the analysis of such excellent raw data took so little account of these and similar 
problems to the extent that at least one of the conclusions could be thoroughly 
misleading. : D. M. LEE. 


GRAY, Susan, W. (1963). The Psychologist in the Schools, Holt, Rinehart and 
Winston, Inc., pp. ix+406, $5.50. 


Although some parts of this book are likely to be of interest only to the American 
reader, the main questions dealt with, those relating to the widening functions of the 
psychologist in the school system, are currently of great concern to L.E A. 
psychologists in this country, who have become increasingly aware of their role in 
educational guidance, the prevention of maladjustment and in “the transfer of 
psychological knowledge, skills and understanding ” to others. Our own services can 
benefit from a knowledge of the ways in which the school psychologist works in the 
U.S.A., where he has closer links with classroom teachers than is usually the case 
here. 

Unfortunately, the style of the book often irritates and it is over-long, repetitive 
and full of platitudinous statements. The important sections (which are mainly in 
the first nine chapters) are obscured by laboured discussions of matters of marginal 
relevance to the main theme of the book. Nevertheless, there is much practical 
wisdom, if little that is new, in the author’s general treatment of her topic. She 
exhorts psychologists to try to distinguish between '' the urgent and the important ” 
in their professional work and, without casting all school psychologists in the same 
mould, would like to see them making major contributions to educational progress 
by initiating and guiding research enquiries in the schools. 

The book should encourage educational psychologists, particularly those in 
training, to give more thought to their professional roles and relationships. There isa 
useful list of references at the end of each chapter. MAURICE CHAZAN. 


Janona, Martz (1963). The Education of Technologists: An Exploratory Case 
Study at Brunel College. London: Tavistock Publications, pp. xiii+ 226, 
30s. 


This is an account of the Nuffield Research Project in Technological Education 
carried out by Dr. Jahoda and her associates at Brunel College of Advanced Tech- 
nology in the period 1959-1962. The problem consisted in examining the educational 
process as this was experienced by students working for the Diploma in Technology. 
The distinctive features of these courses is that students spend a considerable 
proportion of the four years involved working in industry In Brunel College the 
* thin sandwich ' system operates—that is, students alternate six months in college 
with six months in industry. As may well be imagined, this creates problems of 
considerable variety ; organisational, educational, personal. Dr. Jahoda discusses 
most of the problems which confront students, staff and the companies participating 
in the educational process, in an extremely illuminating way. 
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The method used in the research project was to interview students on entry, at 
the end of their first six months in college, and at the end of their first industrial 
period. The college staff was also interviewed, and a sample of firms. Interview 
schedules were extremely detailed and sought to throw light on the socio-economic 
background of students, their previous experience, their professional expectations, 
detailed 1mpressions of their first industrial period, the integration or otherwise of 
the college and industrialexperiences. A large part of the present report is made up 
of the analysis of this questionnaire material but some attempt is made throughout 
to relate it to a wider context of educational principles. Reference is also made to 
“ socio-technical system analysis ” as developed at the Tavistock Institute of Human 
Relations, but this appears as something extraneous to the study. 

None of the conclusions from this investigation come as a novel revelation to 
anyone directly involved in educating technologists—this was not to be expected. 
It is the function of research in this new field of work to confirm or deny one’s 
subjective impressions and Dr. Jahoda has approached this task carefully and 
systematically Her findings should be of value in relation to the problems of 
organising higher education not only in colleges of technology, but also in universit- 
ies. Within the limits of the method of questionnaire studies, this is a distinctive 
piece of work, well-designed, carefully analysed and written up in a pleasing style, 
with a sense of relevance and of the broader issues. Jonn McLzisn. 


Knapp, B. (1963). Skill in Sport. Routledge and Kegan Paul, pp. 204, 30s. 


This most readable and werthwhile book has been written by an author with 
experience both in playing games at the international level and in teaching the 
theory and practice of physical education to university students. Its aim is to 
provide teachers and coaches with a basic knowledge of the main factors involved 
in the acquisition of skill in recreational physical activity. The approach to the 
subject is psychological, the contents cover in detail the factors affecting the 
learning of skills, individual differences in acquiring skills, theories of learning and 
physiological mechanisms. Not only is the coverage broad, but it is fully illustrated 
by examples taken from a wide and varied number of games and recreational 
activities. Unfortunately, the material 1s not organised in an apparently logical 
sequence and the reviewers feel that many readers will prefer the alternative chapter 
arrangement suggested by the author. 

As this book is primarly intended for the teacher and coach, a selection of 
examples should have been included to show how learning theories are applied to the 
organization of teaching material and to the practicalities of teaching. Also, 
technical terms such as sensory feed-back and the interpretation of sensory cues 
should have been expanded and illustrated by references to teaching method. 

This book does not contain original experimental work by the writer, but its 
value lies in the commentary and discussion of a prolific number of articles on the 
acquisition of skillin sport. A number of articles and theories are of doubtful value, 
but Miss Knapp does not accept the sources uncritically ; for example, the claims 
made for the transfer effects of training are treated with caution, and the validity 
of certain tests of phsyical skill is questioned The book’s main appeal will be to the 
student teacher specialising 1n physical education, not only as a basic textbook on 
skill learning, but as a useful source book on experimental data. 

For a long time there has been a need for a comprehensive work on the learning 
of physical skills and the author is to be congratulated on her initiative in producing 
a book which should stimulate both discussion and controlled research in this 
relatively unexplored field of study. GSR. Cross and J. C. Evans. 


LYSAUGHT, JEROME P., and WILLIAMS, CLARENCE M. (1963). A Guide to 
Programmed Instruction. New York: Wiley, pp. iii+ 180, 30s. 
There have been so many books written about programmed instruction that it 


is refreshing to come across one which gets down to tackling the fundamental 
problem of how to set about and sustain what has been called the ‘ inspired drudgery’ 
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of writing programmes. The success of programmed instruction as a pedagogical 
technique will depend on the attitudes and skill of those who apply it. Both a 
favourable attitude and the ability to use the tachnique fruitfully are more likely to 
develop amongst teachers if they understand programmed instruction and have tried 
writing programmes for themselves. 

Although rigorously behavioural in &pprosch, this book defines technical terms 
such as ‘ reinforcement’ in plain language with abundant examples. In this way, 
understanding programmed learning is shown to be but a small step from under- 
standing S-R theories on the Thorndikian model. Using the resulting uncluttered 
reinforcement theory only when absolutely necessary, the authors describe the 
procedures for generating programmes which they found most effective with 
neophyte programmers drawn from education and industry. 

Three chapters deal with the related preparatory tasks of selecting a suitable 
fragment of the syllabus, gathering data on the target population, and writing down 
in behavioural terms the immediate and long-term objectives of the programme. 
By this stage, the inherent structure of the subject-matter may well be apparent 
and may help in the selection of an appropziate paradigm from the list which 
includes linear, branching, conversational chaining, mathematics, and various 
modifications of these. Each of the paradigms is explained and illustrated: but 
thereafter the book is concerned solely with linear constructed-response programmes 
as being more suited to teachers learning the art. 

In the chapter on evaluation, potential users of published programmes will find 
a host of questions likely to embarrass most programmers. But the point is a valid 
one. A programme is of little use unless the educator knows what it can do. This 
means that he must be told all about the initial testing of the programme and the 
assumptions made about the learners' knowledge and abilities. 

Yet the intention of this book is not to curb programmers by supplying their 
critics with ammunition. The authors prefer to welcome newcomers to program- 
ming, especially practising teachers, by giving them guidance and encouragement to 
enable then to set up their own miniature automatic teaching systems which will be 
self-correcting with a strong bias towards higher efficiency. To all who think that the 
operation of such a system is likely to be a rewarding activity, this book is highly 
recommended. BznNARD Dopp. 


MicHOTTE, A. (Eng. Trans. by T. R. Miles and Elaine Miles) (1963). The 
Perception of Causality. London: Methuen and Co., xxii4-425 pp., 
45s. net. 


It is good that we should have this excellent English translation of the great 
work of Professor Michotte of Louvain. It is, of course, already well known to 
psychologists who have had many opportunities of hearing Professor Michotte at 
Congresses as well as reading its French form La Perception de la Causalité which was 
published in 1954. At the same time, however, this work is little known outside 
strictly psychological circles and its revolutionary finding that there may be direct 
perception of causation seems to have made no -mpact on general thinking One may 
still hear a philosopher saying that no one has ever suggested that a cause may be 
directly perceived ; from now on there will be no good reason for any well-informed 
person not knowing that Professor Michotte has suggested this and has directed a 
massive research supporting the suggestion and demonstrating the conditions under 
which this perceptual character is experienced. 

Michotte rightly emphasises the compellinz character of his demonstrations to 
those who have actually seen them. Under certain restricted spatio-temporal 
conditions, & moving patch of colour approaching a stationary patch which itself 
takes on the motion of the first patch after contact is reported to be seen to ' push ' 
the second object; ‘launching ' it if the first patch stops on contact, ' entraining " 
it if the first patch continues to move with the same velocity as the second after 
impact. The narrowness of the conditions unde- which this causal experience occurs 
is no less remarkable than its vividness when it does occur. Lengthening of the 
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interval between impact and resultant movement changes the experience to one in 
which the second movement is ‘triggered’ by the first, but is not a continuation of it 
and finally to an experience in which two distinct movements are seen. Michotte 
also finds this primary causal experience confined to situations in which there is 
apparent transfer of movement ; he does not find it possible, for example, in such a 
case as a light going on when a door is opened. 

The natural line of explanation of a causal perceptual experience is to attribute 
it to the after-effect of experiences in which objects have been seen to be set in motion 
by other objects. Michotte argues srtongly against this view and supports his 
argument by the results of many ingenious experiments. He agrees, however, in 
his latest discussion that, in addition to the primary causal experience, there are 
Situations in which we attribute causation as a result of physical experiences, 
although there are also many physical causal situations in which there is neither the 
primary nor the secondary type of causal experience. For the nature of the experi- 
ence of perceiving such secondary and acquired causal dependence, Michotte 
suggests the term ' perceived dependence ' as distinct from ' perceived causality ' 
which is the name given to the phenomenal character of such experiences as the 
launching or entraining of his basic experiments. 

The translation by T. R. and Elaine Miles is clear and adequate. A final com- 
mentary by T. R. Miles provides a summary which 1s a useful guide to the intricacies 
of Michotte’s researches on perceived causality. There are two final essays by T. R. 
Miles, the first defending Michotte’s phenomenology against the suspicion that it 
offends against neo-behaviouristic orthodoxy by implying the prohibited concepts. 
of ‘dualism,’ the second discussing the question of how far Michotte’s findings 
contradict what was said about causation by Hume. R. H. THOULESS. 


NEWSON, JoHN and ELIzaBETH (1963). Infant Care $n an Urban Community. 
London: George Allen and Unwin, Ltd., pp. 268, 42s. 


This fascinating book records the results of asking 709 Nottingham mothers of 
year-old children questions about the methods they use in rearing and training their 
children. The mothers were chosen to form a representative sample and the 
questions were asked in standard form by health visitors in a half-hour (or longer) 
interview. The answers are carefully analysed statistically for differences between 
occupational groups, and the text is considerably enlivened by verbatim replies 
recorded during some of the interviews. 

In an area which has been considered for decades as of fundamental importance 
for a sound theory of personality development it is surprising that psychologists 
have been satisfied for so long with the lack of solid information about how, in fact, 
children are being brought up in contrast to the plethora of injunctions as to how 
they should be. As the Newsons suggest, there is need for a psychological ecology in 
which problems are studied less in laboratories and clinics and to a far greater extent 
in the natural habitat of the human animal. This is certainly true of early childhood. 
It is unfortunate that we are so dependent on questionnaire material about infant 
rearing practices, especially as certain strong biases exist towards giving answers 
to particular questions in the direction which the subject thinks will be acceptable to 
the interlocutor. The authors bring this out very clearly in relation, for example, 
to such things as the use of the ‘dummy’ as a pacifier, toilet-training, breast- 
feeding, etc. A large number of pit-falls yawn before the unwary investigator in a 
study of this kind. The Newsons, being parents themselves, as well as psychologists, 
are aware of the propensity of parents to lie about their children, and have noted the 
level of confidence with which their material can be accepted and the nature of the 
qualifications to be made before generalising from the statistical comparisons. 

The class differences which emerge are surprising not so faras their nature, but 
certainly as far as their amount is concerned. For example, genital play is checked 
by 93 per cent. of mothers in the lowest occupational class, whereas 75 per cent. of 
the highest group ignore it altogether. 80 per cent. of one-year-olds in the lowest 
group, whose mothers have attempted to train them on the potty, have nevey been 
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successful at twelve months; in the highest occupational group only 36 per cent. 
belong in this category. 60 per cent. of the lowest working group mothers and 
fathers practically never go out together during the baby's first year: the corres- 
ponding number in the highest group is 25 per cent. Working class mothers breast- 
feed their children two or three times as rarely as middle-class mothers; they use 
the dummy as pacifier about twice as often ; they smack their one-year-olds rather 
more often ; the fathers participate much less in the process of rearing the child. 
The great virtue of this book is that it sticks rather closely to the facts and 
avoids large generalisations. The authors obviously know the group on which they 
are reporting intimately. This means that they can often throw a strong light on the 
interpretation of some remark by placing it in a social and cultural context which 
belongs to the respondent and not to the recipient of the remark. In short, this is a 
valuable and sophisticated study ; it is strongly to be hoped that preparations are 
well advanced for further volumes by the same investigators on the later stages of 
child-handling It would be very acceptable if, supplementing possible questionnaire 
material, behavioural records could also be made available for the same groups of 
children and parents. Joun McLzisz. 


PIAGET, J., INHELDER, B., and SZEMINSKA, A. (Trans. by Lunzer, E. A.) (1960). 
- The Child's Conception of Geometry. London: Routledge and Kegan Paul, 
pp. vii 408, 45s. 


This volume must be read 1n conjunction with Piaget's earlier study of the child's 
conception of space. The earlier work provides much of the basis from which the 
authors seek to study the growth of understanding of the more precise concepts 
involved in measurement and metrical geometry, such as a change in position, 
conservation and measurement of length, locating points in two or three-dimensional 
space, and basic notions concerning areas and volumes. The method is similar to 
that used in previous enquiries, entailing systematic question and answer techniques 
with individual children of a variety of ages, but continuing up to and beyond the 
equivalent of the English primary stage. The content of the enquiry is related to the 
kind of geometry taught in England as agaist a more intuitive exploration of 
spatial concepts 1n the earlier book. It is clearly developmental work giving strong 
indications of the origins from which the formal study of geometry should arise. 

The general lay-out of the results conforms to those found in other Piaget 
studies as regards a general development through from ego-centric thougbt and 
action to the final stage of formal operations. Probably most interesting is the sharp 
indication from an early stage that the child gains his impressions from the structur- 
ing of the three-dimensional situation as a whole, and not from consideration and 
summation of the separate parts—a statement having far-reaching implications for 
the early teaching of geometry. Alongside this it is also valuable to note the growth 
of understanding 1n conservation of measurement, this being only finally completed 
at the level of formal operations, where conservation of volume finally becomes 
c A 
While the rigidity of some of the age-ranges given for each stage of thınking 
might well be queried, the general implications of the whole sequence of results 
have strong pointers in relation to teaching. Too often the excellent general treat- 
ment of spatial concepts in the early years of home and school is followed by a long 
silence at the junior school level and an abrupt intrusion into formal geometry at the 
secondary stage Clearly, the main implications of this present work point to a need 
for continuous fostering and treatment of geometrical ideas so that formal notions 
are developed as coherent concepts rather than isolated rote laws. In this context, 
a study of this book is particularly appropriate and valuable for teachers of junior 
school children. D. M. LEE. 
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PicKFORD, R. W. (1963). Pickford Projective Pictures. London: Tavistock 
Publications, pp. xi+ 122, 30s. Picture material 25s. The set 50s. 


- Those who have tried the various available forms of picture projective tests 
with young children, will have noticed the gap that exists between the T.A.T. or 
Symond’s Picture Story Test for older children, and the rather childish and off- 
putting C.A.T. Professor Pickford’s new set of pictures may help to fill this gap. 
The unwillingness of young children to tell lengthy stories, makes it desirable to 
have a large number of pictures yielding many short stories in which recurrent 
themes may be discerned. 

The Pickford Projective Pictures consist of 120 simple line-drawings which 
" cover a very wide range of stimulus content," varying from relationships with 
father figures and authority, and relationships with mother figures including sibling 
jealousy ; to play and confict between children, and death and ghosts. There are 
some twenty distinctive themes represented. Professor Pickford suggests that the 
pictures should be used about six at a time, so that the material will last for about 
twenty therapeutic sessions. However, for diagnostic use, a larger number of pictures 
each yielding a short story might well reveal highly significant recurrent themes 
reflecting some of the child's pre-occupations and worries. 

The book accompanying the test material contains some very useful data for 
responses given to the pictures by fairly large groups of children in Glasgow child 
guidance clinics. Data of this sort are too rarely available for projective tests. 

The book also contains a chapter on the use of the pictures in analytic treat- 
ment which must not be allowed to antagonise people who are allergic to Freudian 
interpretations of innocent comments by children. For example, the comment, 
“This little boy stole apples from a rough farmer's orchard " is interpreted as a 
wish to rob the father of his sexuality. This interpretation goes on, for good measure, 
to interpret the tummy-ache the boy is said to have after eating the apples as a 
phantasy of incestuous pregnancy, on the parr of the girl telling the story ! 

With or without interpretations of this kind, the pictures seem to have been 
used successfully in clihics in Glasgow, and should help to fill a gap in our techniques 
for persuading children to talk in a revealing way about themselves. 

R. HETHERINGTON. 


SIMON, BRIAN and Joan. (Edits.) 1963. 
Educational Psychology in the U.S.S.R. London: Routledge and Kegan 
Paul, pp. xi+283, 40s. net. Translated by Joan Simon. 


This book will help the West to understand something of the psychological 
outlook that informs the whole system of education in the Soviet Union, and give a 
general picture of the vast scope of Soviet educational psychology. It consists of 
fourteen papers written by leading Russian workers in this field. Mr. Simon, who 
made the final selection of papers for translation, says in the Preface that the volume 
was planned with teachers and educationists particularly in mind. 

Part 1 deals with various aspects of the following problems: learning and 
mental development, education and personality development, mental development 
and intellectual backwardness, and the role of speech in the regulation of behaviour. 
In Part 2 there is an interesting account of the changes in outlook regarding the 
psychology of learning over the period 1900— 1960, followed by a more detailed 
discussion of such topics as the promotion of abstraction and generalization, problem- 
solving, etc. In the final part of the book there are papers dealing with aspects of the 
following topics, and others: the psychology of reading, problem-solving in arith- 
metic, mastery of scientific concepts, mastery and application of concepts in physics, 
transfer to technical skills. 

The reviewer had three main reactions to this book, which he can only mention 
very briefly. First, the outlook of Soviet and Western educational psychologists is 
much the same over some main issues. For example, Vygotski's hypothesis that the 
process of development follows on the process of learning which creates the potential 
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development, is not far from the views of Hebb and perhaps not unrelated to those 
of Festinger. Again, “ a low level of development of verbal-logical thinking results, 
of course, in little capacity not only for mathematics, but also for a number of other 
activities "(Krutetski, p. 231) could have been written by British psychometrists. 
Second, the Russians have paid attention to certain issues that the West has tended 
to neglect. Elkonin (p 166), quoting Galperin, indicates that the mastering of an 
action on the plane of speaking aloud aids the growth of mental operations ; as does 
Leontiev (p. 80). Moreover, Soviet psychologists pay considerable attention to the 
moving back, as it were, from abstract thinking to practical application (p. 128 and 
pp. 198 ££) ; thisisa matter of great consequence in polytechnical education. Third, 
some of the hypotheses are vague, or are proposed on the basis of small numbers of 
children ; these need verification. For example, exactly how effective are “Pavlovian 
methods of couterposition " in aiding differentiation in the second signal system so 
aiding abstraction and generalization (p. 124) ? 3 

This is an important book. It is rich in ideas and cannot be neglected. We are 
indebted to all those who made the volume available to the English speaking world. 


SoLoMoN, H. (Edit.) (1961). Studies in Item Analysts and Prediction. (Stanford 
Math. Studies in the Social Sciences, VI); U.S.A.: Stanford University 
Press. London: Oxford University Press, pp. ix+310, 70s. 


This is definitely a book for those interested in mathematical theory. It brings 
together a number of recent contributions to the fields of Item Analysis and Predic- 
tion, and develops a coherent picture of the statistical background involved. But 
the reader must, in general, apply the theory for himself. Apart from a rather brief 
Editor’s introduction, there is little practical help or guidance as to the types of 
problems which might be solved by the use of the various theoretical aspects dis- 
cussed. : 

Whilst such volumes as this could play a considerable role in improving and 
developing techniques for educational research, they will not achieve their goal 
until the writers generate a better synthesis of theory and practice. Most practising 
research workers have neither the background knowledge nor the time to unravel 
the complexities of advanced statistics before applying it to the problem in hand. 
Theories of as complex a nature as those propounded in this book need to be illustra- 
ted by a variety of very suitable and careful applications. D. M. LEE. 


Trapp, E. P., and HMELSTEIN, P. (Edits.) (1963). Readings on the Exceptional 
Child. New York: Appleton-Century-Crofts. London: Methuen, 
pp. 674, 55s. 


Jenkins, J. J., and Paterson, D. G. (Edits.) (1963). Studies in Individual 
Differences. New York: Appleton-Century-Crofts. London: Methuen, 
pp. 774, 63s. 


Dz Cecco, J. P. (Edit) (1963). Human Learning in the School. New York: 
Holt, Reinhart and Winston, pp. 636, $4.75. 


. Books of readings hold a dubious position among teachers of psychology. Some 
regard them as undesirable in that they discourage the enquiring student from 
searching for his material and learning more in his search than that for which he 
sought. Others regard them as inappropriate unless they are exactly tailored to their 
own course. Yet others, pointing to the difficulties which arise when large classes of 
students have their attentions drawn to particular articles, find books of well- 
chosen readings a great help. Judging by the number of books of readings being ~ 
produced in America at the moment, there would seem little donbt as to the value 
placed upon them there, 3 : i 
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All these books are good of their kind, they cover different areas of study and 
illustrate different approaches to their compilation. That by Trapp and Himelstein 
consists almost entirely of recent, including a few unpublished, papers illustrating 
such aspects of the study of exceptional children as mental defect, giftedness, 
sensory handicaps, brain injury and, in a short, rather oddly assorted section, 
emotional disturbance. This last reflects, no doubt, the paucity of research papers 
in this field. Three ways of selecting papers were used. Reading lists from 100 
American universities were compared to extract a central core of common reading. 
This proved unhelpful since there was virtually no overlap between them, and many 
lists were ‘ grossly outdated and meagre in the experimental area.’ A second 
approach, which was somewhat more successful, was to analyse the current text- 
books on exceptional children. Most papers were selected on advice of research 
workers who were approached. A high proportion of the papers will be unknown 
to most lecturers in this country 

Jenkins and Paterson have taken a fairly narrow theme, that of the historical 
development of the search for intelligence and the understanding of its nature and 
development, Papers are arranged chronologically and the first, as might have been 
predicted, is from Galton’s Hereditary Genius, then follow most of the great names in 
the history of mental testing: Spearman, Thorndike, Binet, Terman, Thurstone, 
Kelly, and so on to Vernon, Anastasi, Wechsler, Husen, Cattell and Guilford, 
covering nearly 100 years. Most of these papers are well known and frequently 
quoted, together they tell a coherent story of the evolution of the present position 
with its persistent doubts and its pointers for future research. 

The third of these readings, edited by De Cecco, aims at showing the relevance 
of psychological research and theorising for classroom teaching and learning. 
Virtually, the whole field of educational psychology is covered in ten sections, 
containing between three and six papers each. About half the papers are original 
research reports, the rest are survey articles by well-known workers. Among the 
topics covered are motivation, programmed learning, problem solving, individual 
differences and learning in a social setting. There are no papers purely on animal 
learning and all have a clear application for the classroom. Very often papers under 
the same heading are written from different viewpoints as, for example, when Burt, 
Guilford and Nancy Bailey all write under Intelligence. 

It is difficult to evaluate books of readings, except with reference to specific 
courses but, for what it is worth, this reviewer would find Jenkins and Paterson a 
useful adjuct to his book list for students, De Cecco rather less so, and Trapp and 
Himelstein much less relevant. They are all worth a place in college, department and 
institute of education libraries. L. B. Brrca. 


STUART, A. (1962). Basic Ideas of Scientific Sampling. Charles Griffin and Co., 
pp. 99, 18s. 


The first standard works on sample surveys, such as those of Yates and Deming, 
appeared between ten and fifteen years ago. Admirable as these books are, they are 
rather too specialised and technical to be easily absorbed by many psychologists 
and teachers who need to know the basic principles of sampling. General texts in 
statistics, on the other hand, usually limit themselves to inference from simple 
random samples, and this kind of sample is nearly always impracticable in educa- 
tionalresearch. There has thus been a need for a clear and simple introduction to 
such methods as cluster sampling, in which the sampling unit may be a class or a 
school instead of an individual, since relatively few of the research workers who use 
samples of this kind are completely informed about the principles involved and the 
methods of analysis required. 

Mr. Stuart’s monograph should help to bridge this gap. It is succinct, clear, and 
demands as little mathematical knowledge as any work on a statistical topic can do. 
There are frequent numerical examples, and these are kept beautifully simple. It is 
not, however, what the Americans call a ‘ cook-book,’ meaning one that provides 
rules-of-thumb for the commonest situations without much attention to theory. 
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The emphasis throughout 1s theoretical, few formulae are provided, and the main 
aim is clearly, as suggested by the title, to familiarise readers with the basic logic 
rather than with the computational complexities of the various sampling processes. 

This aim is successfully achieved within the limited compass of the book, There 
are forty-four sections, averaging two or three pages each, of which the first nineteen 
deal mainly with general topics such as estimation, the normal distribution and 
confidence intervals. The last but one goes as far as sampling with probabilities 
proportional to sizes in cluster and two-stage samples. 

Within his self-imposed limits, the author has provided what is probably the 
clearest available exposition of sampling theory to the non-mathematician, and one 
that can be studied with benefit by the majority of people doing research in psy- 
chology and education. H. J. BUTCHER. 


Wait, W. D., SCHONELL, F. J., and Orson, W. C. (1962). Failure in School. 
Unesco Institute for Education, Hamburg, pp. 158, 11s. 


This report is, regrettably, the final one of the series of expert international 
studies sponsored by Unesco during the last decade. Although the main discussions 
which culminated in this volume, took place ın 1956, the book strives to be up-to- 
date. Many references are made to recent publications, and the current views 
of psychologists on school failure and aspects of child development are lucidly 1f 
tersely explained. The authors make ıt clear that they are concerned not only with 
backward pupils and failure $t school but with the failure of schools and the back- 
wardness of educational systems and practices. 

Dr. Wall contributes four valuable chapters, in which he defines the problem, 
discusses the incidence and consequences of failure, and inquires into the socio- 
economic, familial and scholastic factors associated with educational retardation. 
He illuminates his theme with some useful figures, making comparisons between 
different countries and systems, while not failing to point out the difficulties 1n the 
way of obtaining full and reliable information as well as of interpreting the available 
data. His clear discussions of the significance of discrepancies between ability and 
attainment (Chapter II) and, 1n Chapter IV, of the effect-of different levels of 
parental encouragement on attainment—and, indeed, on measured ‘ g’——are of 
particular interest. 

Professor Olson's single chapter, dealing with aspects of child growth, is excel- 
lent. He produces evidence to support his contention that achievement levels must 
be viewed in the light of a child's total development, and that much that is called 
'failure' is not genuine child failure, but failure administratively defined. The 
range of individual differences is so wide that the use of class averages as a yard- 
stick is arbitrary and misleading. Like Dr. Tanner in this country, he stresses that 
chronological age is an unreliable index to a child's educational status or readiness for 
learning. 

Although there is little new in Professor Schonell's two chapters, the first is a 
guide to child study that teachers and students in training will find extremely 
helpful, and the second a succinct account of possible ways of organizing remedial 
measures to help the failing child. 

While a report of this kind is bound to lack depth in some places, it will help to 
clarify thought on school failure and can be recommended to all educationalists, 
including those who are responsible for the organization of education. There is a 
comprehensive bibliography which covers publications in eight languages ; the price 
is reasonable but the binding flimsy. M. CHazan, 
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Tor journals. 
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essential for effective presentation. The same material should not be presented in a table and a graph. 
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ettering on Di being ın pencil. Lines in tables should be ruled in pencil. Each table or graph should 
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9 — Reprints. Authors receive fifty copies of their papers free Extra copies, in multiples of fifty 
may be had at cost price if the order is given when the proof is returned. 
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